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Starting with the relativistic Boltzmann transport equation, basic equations for relativistic magneto- 
hydrodynamics of perfect and imperfect gases are derived. Relativistic magnetohydrodynamical 
generalizations of Kelvin’s circulation theorem, Helmbholtz’s vortex theorem and Rankine-Hugoniot’s 
shock relations for the simple gas are also given. 


$1. Introduction 


In the recent striking progress in plasma physics, it is sometimes required to treat 
charged particle streams with very high speeds or with very strong magnetic fields. In 
this paper we will give a relativistic generalization of magnetohydrodynamics throughout 
in a covariant manner. 

Relativistic generalizations of ordinary hydrodynamics were performed by Eckart,” 
Taub” and Synge.” By combining these methods we will take here a method which is 
completely covariant and convenient for a physical interpretation. Though relativistic 
invariance of the distribution function has been discussed by several authors, a direct 
covariant treatment of the relativistic Boltzmann equation is attempted in § 2. Starting 
with this equation, relativistic mean value equations are given in § 3 particularly for a 
simple fluid with the method used by Eckart and Taub. Basic equations describing a 
fully ionized fluid and a partially ionized fluid are given in § 4 and § 5 respectively. 

The relativistic generalization of Kelvin’s circulation theorem of ordinary hydrodynamics 
was performed by Frankl,” and we have been told recently that a generalization of the 
vortex theorem was also performed by G. Iwata for magnetohydrodynamics with the 
method of Synge. Since the Lorentz force is not a conservative force, the ordinary 
circulation and vortex theorems do not hold. But a conservation theorem holds with 
respect to a certain quantity generalized from the ordinary one. The constants appearing 
in the relativistic magnetohydrodynamical generalizations of circulation and vortex theorems 
are not formal four-dimensional generalizations of the ordinary expressions of the circula- 
tion and the vorticity. This will be treated in § 6. 

Generalizations of Rankine-Hugoniot’s shock relations are performed by Taub”? for 
the relativistic case and by de Hoffmann and Teller” for the magnetohydrodynamical case. 
In §7 shock conditions in a one-dimensional flow of a simple fluid in a uniform electro- 
magnetic field are given in a somewhat general form by using the method of de Hoffmann 
and Teller. But generalizations of the vortex theorem and the shock relations to the 


nonsimple gases such as a neutral gas plasma are not yet accomplished. 
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§2. The relativistic Boltzmann equation 


Let us consider the relativistic transport phenomena in the Minkowski space with 


the coordinates 


Lak, Cy, Pg aa 
and the metric 
gGag=9"=—-g"*—1, te | (a#f). (2-1) 
We use Latin letters k, /, --- for the space indices 1, 2, 3, Greek letters a, B, --- for 


the space-time indices 1, 2, 3, 4 and the usual summation convention with respect to the 


doubled indices. 
The four-velocity is defined by 


@=(1——)-“"c'dt /de, tH Py, 
B?=c~*((dx' /dt)? + (dx /dt)? + (dx*/dt)*) (2-2) 
and they satisfy the identical relation 
ut, =—1. (2-3) 
The direction of the velocity u* means the direction of the particle path in the four 


dimensional world and a component of a vector of time like length dz in this direction 


is u“dz, whose squared length is indeed given by (2-3) as 
(u* dz) (u,dz) = —d2. (2-4) 


Hereafter we use ct as a parameter denoting the invariant length along the world line or 


the proper time. Using the velocity vector we can introduce a symmetric tensor by 
ag ta (2-5) 

which is orthogonal to the direction of the particle path, for 
u, s =u, 9™ + u, ut = 0. (2-6) 


So it can be interpreted as lying in the proper space direction. 

On account of the restriction (2-3), the velocity space in the relativistic theory has 
a peculiar structure. It is a three dimensional subspace restricted by (2-3) in the four 
dimensional space with the Minkowskian structure, having the invariant volume element 


dU=du' du? du®/ut= (1+ bu)" dS u : (2-7) 
where 


d* u=du' du? du’. (2-8) 


As discussed in detail by Synge and others, the concept of the number of particles 
occupying the invariant volume element fdU is relativistically invariant. So the distribu- 
tion function f(x*, u*) has an invariant meaning. Since the equations of motion for a 


particle with rest mass my and charge e in an external electromagnetic field F,,=0, A, 


—0,A, is 
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cmy:du*/de =F", F*=eu,F“, (2-9) 
1/c-dx*/dr=u", | 
the rate of change of f during dt=c"'u‘dz i 
{f+ cutdt, u*+c' my F%*dr) —f(x*, u*)} /dt 
= (cu*- Of /Ox* + co" my | F*- 0 f/Ou*) -c/u'. (2-10) 


If the rate of change of f due to collisions is denoted by 4}. f, the relativistic form of 
Boltzmann’s transport equation is 


CUO /Ox Ace tg RAO f/ Ona cw a fe Af (2-11) 


m4 2 f * nae 
u' A’, is the invariant collision operator. 


where J pore 
Now let us show that this relativistic equation is equivalent to the ordinary Boltzmann 
equation. Since the number of independent variables is three, we can regard u’ as the 
independent variables and u* as a function of them. In this case the distribution function 
can be expressed as f*(x*, u’) =f(x*, u*). Since u* is a dependent variable, 
pi OF, w!) u') _ Ft (A as Ou! f(x", u*) ) 
Out Ou' ous 

=F*.0f/Ou'+F*-u'/u'- 0 f/ou' 

=F*.0f/0u'+F*.0 f/du' 

SOs Ou". (2-12) 


where we used the well-known identical relation among the components of the four-force 
rg EAN ; (2-13) 
Therefore the equation (2-11) reduces to the familiar form 
cu*- 0 f* /Ox* 4+ my Ft Of * /Ou' = ow f*. (2-14) 


Thus we have verified that (2:14) is equivalent to (2:11). This is also a verifi- 
cation of the relativistic invariance of the ordinary form (2-14), though it is not written 
in a completely covariant form. Since only independent variables appear in the equation 
(2-14), it is sometimes more convenient particularly for integral calculations and we will 
use it in some cases hereafter. If we use the asterisk generally in the same meaning as 
above introduced for f, the relation (2-12) holds for any function ¢ (x*, u*) =y* (x, u’). 


F*.09* /Ou' =F*-0y/Ou". (2-15) 


§3. Derivation of equations for a simple gas in an external 
electromagnetic field 


As is well known, equations for averaged quantities can be derived from the Boltzmann 
equation. But the method used for the ordinary hydrodynamical equation can be applied 
directly only for the case of the force independent of velocities. Though the magnetic 
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force depends upon the velocity, it has been shown that it can be applied also to an 
external magnetic force in the nonrelativistic theory. We can show that this holds also 
for the relativistic case. ; 

Since the distribution function f is assumed as a well integrable function, for a 


3 as = 
function 9(x%, u*) and the external Lorentz force F*(x*, u ) =u,F* (x) we can easily 


prove the relations 


aN aby Als | “du—| «OP gy (3-1) 
| ae Uae IPE if Sa 
| ors SL au=— | pre? dU. (3-2) 
Ou Ou* 
The relation (3-2) is verified as follows, 
ete Ca! ( peeaiew Aeaaeeeslin ye 
ie! hs = Mr IE. eax: 
le Ou’ (1+u'u,)'" \f Ou’ (9 (1+4'u,)'” 
, Oy* d*u \ 3-2 Fe 
ee Wy he — p == 4 sae 
\f Ou’ (1+u‘u,)'” i¢ Out aes 
=— [per au, (3-3) 
Oui 
because 
u,F™ (x) 


(3-4) 


Ss (* F™(@) 


\-2 se 
Ou’ \ (1+ u‘u,)'” Ou’ \ (1+u'u,)'” : 


In the usual case," when the function means a conserved quantity in collisions 
q 


the collision term in the Boltzmann equation vanishes : 
| GA f dU=0. G3 , 5) 


Assuming that this holds also in the relativistic case, for such a function © we can derive 
the following mean value equation, 
pal | fepusdu— | fe 9 (gu eee | fertims" Ee: a dU=0, (3-6) 
xen s Ox* Ou a 
by multiplying g by the Boltzmann equation, integrating over all the velocity space and 
using the relations (3-1) and (3-2). 


Since the rest mass is a collision invariant, setting @©=m, in (3-6) we have 
8 /Ox*. | Frmutdu=o. (3-7) 


In order to express this equation in terms of quantities of the mean flow, we first consider 
the mean flow velocity. Though the transformation character of the four velocity is of 
a vector, its geometrical meaning is the direction cosine of the world line and it is a 


dimensionless quantity. So we may define the mean flow velocity as the direction cosine 
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of the mean world line, so to say, of the particle world lines, namely of a line drawn 
in the direction in which the particle world lines are the most dense. Thus it can be 
defined as the direction cosine of such a vector whose components are the mean values 
of particle velocity components u#*, where the bar stands for the mean value 


A=1/n-| fadU (3-8) 
and 


n=|fau G9) 


is the invariant mean particle number in unit proper volume. If we denote the length 
and components of the line element of the world line of the mean flow by (dz) and 


(dx*) respectively, a proportional relation 
(dx?) /it = (de)// — it" tg (3-10) 
holds and the mean flow velocity v* becomes 
v= (dx") (dry =i" /\/ — i it, (3-11) 
and it satisfies 
U V,= —1. (3-12) 
Now if we define the effective flow density per unit proper volume by 
p=nimVY — i ig, (3-13) 
the equation (3-7) becomes 
0 (pu*) /Ox*=0. (3-14) 


This is the continuity equation of flow. Thus the flow behaves itself as a fluid whose 
density is different from the sum of the particle rest masses. The effective density occurs 


from the thermal motion, since setting 

uX=a% + u/% (3-15) 
we can easily show 
a= — (+0 ul). (3-16) 
Since the four-momentum is a collision invariant, setting G=m)u° in (3-6) we have 

8 /Ox*- | fing wt dU = | frau. (3-17) 
This is the equation of motion, the balance of forces and the conservation of energy 
OT 0x8 ak, (3-18) 


where 


T | fine wd (3-19) 


6 K. Goto 


is the mean energy momentum tensor, 
jo= [fend =m pr, (3-20) 
is the mean four-current and 
jeF "= | feu," dU= \fFtau (3-21) 


is the mean Lorentz force acting on unit proper volume of the fluid. 
Now let us consider the energy momentum tensor and express it with the flow 


velocity v*. By using the mean velocity components #* it can be written as follows. 
TP mycnit i + | fine (uta) (u’—a*) dU. (3-22) 


The integral in the right-hand side is to be calculated after f is determined by solving 
the Boltzmann equation. The exact solution of it is almost impossible unless the force 
is absent. In the ordinary hydrodynamical case, where the treatment is not relativistic 
and the electromagnetic force does not appear, the zero approximation for f, which is the 
Maxwell distribution function, yields the ordinary hydrodynamical equation for the friction- 
less fluid as the equation of the mean motion. The first approximation for f yields the 
Navier-Stokes equation, and the second approximation brings Burnett’s equation.” Also 
in the case of relativistic magnetohydrodynamics, it is desirable to develop a similar 
method, which is called the Enskog-Chapman method. But as this is too complicated, 
we will now use the approximate assumptions following Eckart.” 
The zero approximation is 


TG =e + po, (3-23) 


where 


w=nm, Cc (—i* it) =nm, e (1 +4" u,’) (3-24) 


is the energy density in a system at rest with respect to an element of the fluid, with 


the internal energy per unit proper volume nm cu’*u,’, and 
a — 9g + oof (3-25) 


is the tensor indicating the proper space direction of the mean flow. This satisfies the 
relation 


Ty = —w+ 3p * (3-26) 


and its alternative form 


T = (w+p)v*e*+ py™ ~ (3-27) 


is the well-known classical expression for the relativistic fluid without friction. 
Next, as the first approximation for T**, we will take . 


TH=TH+SE, (3-28) 
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where /# is the coefficient of viscosity and pS dG is the viscous stress tensor defined’) by 
G =o o"? (Ov, /Ox° + Ov5/Ox") —2 0% ot. Oy,./Ox®, (3-29) 


which satisfies the conditions 


Ve 5G) =0, (3-30) 
(ja = 0. (3-31) 


The condition (3-30) means that the viscous tensor is in the direction orthogonal to 
the flow velocity. The latter condition (3-31) means that the relation (3-26) holds 
also for the first approximation or that the invariant static pressure is already taken into 
account in T (}. 


In the non-relativistic approximation o”° is reduced as 
o”>0, oF 50%, (3-32) 
so that the viscous tensor becomes the well-known Stokes’ form 
PS aj) > 2: (Ov*/Ox, + Ov! /Ox,) —% po4. du" /Ox*. (3-33) 


Lastly adding the equation of state to these equations we have a system of basic 


equations of flow. 


§ 4. Equations for the fully ionized fluid 


The fully ionized fluid is composed of ions and electrons. We consider the simplest 
case where the ions are of only one sort. Moreover we treat here the ideal case involving 
only the elastic collisions and assume that inelastic processes such as excitations, ionizations, 
recombinations and nuclear reactions do not occur. We use suffices (i) and (e) to denote 
quantities for ions and electrons respectively. 

In the assumed case both ion and electron rest masses are collision invariants, so 


that the continuity equations hold for ions and electrons independently. 


(P@ UH) /Ox*=0, (4-1) 
O (P(e) Vey) /Ox* = 0. (4-2) 
The flow equations for ions and electrons can be derived in the same way as in § 3. 
OT Gf /Ox" =juyyF "+ APS, (4-3) 
OT Oke fie Ft APA (4-4) 
where 
J@= [Fees Uj Uy = Cy MHP HUG (4-5) 
jo=— |fe 15 Ue) = —e MES Pe VO (4-6) 


are the four-currents of ions and electrons and 
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By = a ld * 3 3 
APS = | moo Aon fio IU =e | mooted Noon f5 TU (4-7) 


f 8 a wl 8 4 * f3 ° 
APS = | maou co fie) We = | moon co fed “ (4 8) 


are the total changes of four-momenta of ions and electrons by collisions, respectively. 
Since a collision between identical particles does not change the resultant momentum of 
particles, only collisions with other particles yield net momentum changes. Hence only 
a part of 4P5 gained by the collision with electrons, say IP, will survive, and also a 
part of 4P.5 gained by the collision with ions, 4P,,5, will survive. 

If;we have two relations among 4P,5, IP.5 and flow variables, we can rewrite the 
flow equations (4-3) and (4-4) in different forms without JP,,,, AP,5, after elimination 


of them. By the conservation law of the total energy and momentum, the relation 


Y 2 4p e (4-9) 


i (e)> 
or equivalently 


4P.g=—4P,<, (4-10) 


holds. This is an exact relation, arising from the conservation law of dynamics, but as 
the second relation we must set a somewhat approximate assumption, Since the velocity 
change of one of two colliding particles is proportional to the velocity difference between 


the two particles, we assume 


4 Ce ne ak « © ae —i —i} 
AP = Cup (a (ec) t «) > Ca [Cl PE lw ™ooM~po| (4-11) 


€ 


where 7," may depend on several conditions such as field strength and temperature but 
to a first approximation it may be taken as a constant 7/,%. Since in the non-relativistic 
Stationary case the resultant equation reduces to Ohm’s law with 7 as the electric resis- 
tivity, 7 can be interpreted as an extension of electric resistivity. 
Now using the two relations (4-10) and (4-11), we can eliminate JP,3 and JP.3 
and obtain the equations of flow 
O(TH+T eS) 
“vet G Oe or . RY 
ane ~~ Gar +jerx) sae (4-12) 
x 
O(meyoT Gs — Mayo T 3) 
ALICIA ENC) EN CNC . ss . 8 : 
Ox" = (Me oj x— Mejor) F™ + (moot Mayo) CrNa (Ua — Mo) + 
(4-13) 


Thus we have obtained the basic equations for fully ionized fluid in terms of ion 
and electron variables. In order to describe such a composed plasma as if it were a 


single unified fluid we must introduce total flow variables describing the plasma flow as 
a whole, 


total mass density : P=PwomtPe> (4-14a) 


total mass velocity: o*=p7! (PMUBHCHUMS)s (4-14b) 
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total charge density: o =e) m@o Pu) — emo Pre» (4-14c) 
total current density: j*=e( mio Puy VS —EMe0 Pw VE: (4-14d) 


Inversely the ion and the electron variables can be written with the total flow variables 


Pa=(eetmeo)/(e+mema ew), (4-15a) 
Pe = (e@ P— mao) / (ew +ma mee); (4-15b) 
VG = (ep +m j%)/(EP+meF) (F5=eHMBoPaVA), (4-15c) 
Ve = (Cy PL* MGI") / (EP —MHe)  (FS=—EMGoP@UE)- (4-15d) 


It is not difficult to write the basic equations, (4-1), (4-2), (4-12) and (4-13) 
in terms of new variables ~, o, v* and j* by performing the variable transformations 
(4-15). But the resultant equations are not so simple in general, and this seems to 
show that it is neither natural nor advantageous to describe two general interacting, non- 
linear, relativistic flows as a unified flow. Such a picture of the total flow seems to be 
successful in the case of a slow or moderate speed plasma. 

However we will give here the equations of total flow for the most important case 
of a neutral frictionless plasma of ions and electrons with e,)= +e, Ce) = —€, Mi) > Mes 
a= 0. In this case (4-15) reduces to 


Pa=P> PoO=MeooMaoP» (4-16a) 
VQ=uv+ Meyof/€P > (4 , 16b) 
Ve = —U moj / ep - (4 : 16c) 


Since 
Pam VG =P + mee j* 
P(e) Vey = — Mee) MGy PU +(e Cj * 
the equations of continuity, (4-1) and (4-2), become 
9 (ep0* +m j*) /Bx*=0 
0 (—epu* +m j*) /Ax*=0. 
or equivalently 
0j*/Ox*=0 (ants) 
0 (pv) /Ox*=0. (4-18) 


These are the conservation equations of charge- and mass-flows. 


Since 
ve Gav Et (jE + Pet) meo/e+i7" (mon/e0) 
tog =o — Gv +7' v7) meyo/ept j*j* (meyo/e?)” 


joTIO=)s 
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the first equation of flow, (4-1 2), becomes 
OT JOP Se (4-19) 
where 
T® =A +B jo +pPo) + CP +py” (4-20) 
with 
A=W tW os B= (meV w—mMoW w) (ef, C= (meoW w+ mnoW wo) / Cer)’ 
Ww =™y+Po, Vo=“otPor P=PotPo- (4-21) 
Since 
Meo J i — Mayo JQ = CPU" Mw Js 
(met mayo) Cer (UHV) = PGs 
the second equation of flow (4-13) becomes 
OR*'/Ox* = (epv,+meoj~)F™ + epyj", (4-22) 
where 
R®* = Bepr*v + Ceo (j7 + j* v*) +777? (moo V wo — moo ww) /(ep)® 
+ (Me Pi—™woPo) I”: (4-23) 


Thus we have obtained basic equations (4-17), (4:18), (4-19) and (4-22) for the total 
flow, but this is not necessarily convenient, but sometimes it is preferable to use (4-3) 
and (4-4) with (4-11) or to use (4-19) and one of (4-3) and (4-4). Of course, 
to complete the system of basic equations we must add the equation of state. 


$5. Equations for the partially ionized fluid 


The partially ionized fluid is composed of neutral atoms, positive ions and negative 
electrons. We consider the simple case where both atoms and ions are of only one sort 
and there occurs only elastic collisions. We use the suffices (a), (4), (e) to denote the 
quantities for atoms, ions and electrons, respectively. 

Equations for such a plasma can be obtained in a similar way to that used in § 4. 


The continuity equations for the mass flow hold for atoms, ions and electrons indepen- 
dently : 


9 (Pa) Vay) /Ox"=0, (5-1) 
9 (Pw Vp) /Ox*=0, (5-2) 
O (Oe) Ute) /Ox*=0. (5-3) 


The flow equations for atoms, ions and electrons can be 


§ 3 and § 4, 


derived in the same way as in 
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OT G5 /Ox* = AP Es a APES F (5-4) 
OT & /Ox* fay F + APE.) + 4P&,, (535) 
OT 5 /Ox" = jy F* + 4PE) + APE, (5-6) 


where for instance 4P,,,. is the net momentum change of atoms gained by the collision 


with ions, and the other quantities have similar meanings. The relations similar to (4-9) 
are 


APG) = — AP Gay, AP ey = — APE, AP) = — AP Qy. (5-7) 
As the relations similar to (4-11) we assume 
AP &, Coe Np” (%6 —v@) > (5 -8a) 
AP oa) = Cea 3” (Vee) — Via) » Coa = © Pee) Pca Mey Mayo (5 -8b) 
AP Gy = Cia * (UV —Vd) > Ca= EC Pw Pray MHOMENO- (5 8c) 


' tr : . oth a Are . . 
The terms 7;", 7;°, 73;/* have meanings of particle resistivity, being proportional to the 


reciprocal collision time intervals, and to a first approximation they may be constants 


7 =196"s = 7 Is 6" = 9" Is" OF?) 
Adding three equations, (5-4), (5-5) and (5-6), we have 
O(TSt+THS+T ES) /0°= (jar tier F™. (5-10) 


This is the most convenient equation to describe the plasma flow as a whole. As the 
basic equations of flow we can take (5-10) and two of (5-4), (5-5) and (5-6) with 
(5-8) and (5-9). Such three equations of flow and the continuity equations, (5-1), 
(5-2) and (5-3), are the basic equations for the partially ionized fluid. Of course for 
the complete system of the basic equations we must add the equation of state. 

In order to obtain the picture of a single unified fluid, we can introduce the total 
flow variables p, o, v% and j* in the same way as in §4, but the number of new 
variables is not sufficient to eliminate atom, ion and electron variables O@), Vi)» P> 
v%5, M%@) and vt). This shows that the general partially ionized plasma should be treated 


as a composed medium and not as a simple unified fluid. 


§6. Generalized circulation and vortex theorems for isentropic flows 
of simple perfect fluids 


Basic equations for a simple fluid of a perfect gas with charge e are the equation 


of continuity 


8 (pw) /Ax*=0, (6-1) 
the equation of motion 


8 (Jute? + py) /Bx*= ems pveF*, (6-2) 
the equation of the relativistic thermodynamics 


T dS=dJ—p™"'dp (6-3) 
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and the equation of state, which we assume to be barotropic, where 
J= (+?) /p (6-4) 
and T is the temperature, J the entropy and J the heat content per unit rest mass. 


In order to derive the conservation theorems of generalized circulation and vorticity, 


we shall rearrange the basic equations. By using (6-1) we can write (6-2) in the form 
gee (Jv) pre iae he +- Cy Fr, (6-5) 
Oe p > Cin amy 


Further by means of ee this becomes 


a] . 
Fe fy ae v, F*. 6-6 
tg J) (he = a pees (6-6) 
By differentiating (3-12) we can show 
re) a] 
Ramee fs eee es 6-7 
ve Ox, Ue) Ox, (4:7) 
From (6-6) and oe we have 
Ox my 
or 
re) e as 
Jey ay, = oe a P ‘ 
: | Ox, My ) ») J axe (6 9) 


This is the generalization of the vorticity equations of classical hydrodynamics. 
Now let us consider an isentropic flow. For such a flow (6-6) reduces to 


a) —— 2) 4 wey Ot 


mM Ox, 8 


(6-10) 


When a closed curve C, whose line element is dx*, is given in the four dimensional 


world, we define the generalized circulation 7” with respect to a curve C 


= | Jvatems" A.) Ox, (6-11) 


“ 
C 


Then the variation of J’ becomes 


aa 


(J vat emy' Ay) Ox* + (Jvg+ ems" A,) - hal = 


I 


be 
IG 


[—98J+3J0 (vev*) + emy'v,04* + ems" A, Ov") 


Fa tem tn Ge Ot (Tact Ay) de" 
x 


Xe 
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s 8 Jt em" Av] 
Cc 

==} (6-12) 
Thus we have generalized Kelvin’s theorem of circulation of classical hydrodynamics in 
the case of relativistic magnetohydrodynamics. It should be noticed that in our case the 
constant in the theorem is not a formal four-dimensional generalization of the expression 
of classical hydrodynamics, but the integral (6-11) remains constant with respect to an 
arbitrary closed curve in the Minkowski space time. 

From the above obtained theorem we can derive the theorem of vorticity in the 

usual way. By the generalized Stockes’ integral theorem in the relativity theory,” the 
integral (6-11) can be written as a surface integral over the surface S enclosed by a 


given closed curve C 
P= | lwo, (6-13) 
S 


where 


w= (Ju + em At) 2 (Jur tems" A) (6-14) 
Ox, Ox, 
and 0S,, is the surface element. If we write the invariant form in the integral symbol 


as 


»*3S,,—9, 08, (OS)2=ASz,-dS* 


it becomes the familiar form 


r= |{»,09. (6-15) 
s 

Thus the circulation theorem (6-12) can be converted into the form of Helmholtz’s 
theorem of vortex, namely the strength of generalized vorticity w,0S remains invariant 
during the motion of the fluid and the generalized vorticity can be neither created nor 
destroyed. This is the generalization of the classical vorticity theorem. It should be 
noticed that the generalized vorticity in the theorem is not a formal four-dimensional 
generalization of the expression of classical hydrodynamics, but we must generalize it in 
the form (6-14). This shows, inversely, that the formal generalization 


( Ov Ov* ) N 
— OSes 
Ox, Ox, 


does not remain invariant and can be created and destroyed even in the field free case 


Aa, 


$7. General shock conditions for one dimensional flow of a simple 
fluid in a uniform electromagnetic field 


Since, as is well known, the energy momentum tensor of the electromagnetic field 
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FOR +A go Pek (7-1) 
satisfies 
OT anal OX = —j_-F (7-2) 
the flow equation of a simple fluid can be written as 
O(Tiiiw +T fara) /Ox*=0, (7-3) 
where 
Tiow= (w+p)rr+p™, (7-4) 


neglecting the viscous terms. 

Let us consider a one dimensional flow in a uniform electromagnetic field. We 
assume that the fluid flows in x, direction (v,%0, 7y=v,=0), the external field is 
constant and a shock wave, whose front is in the plane perpendicular to the x, axis, is 
travelling in x, direction with a uniform velocity U. Now let us transform the coordinate 
system so as to be at rest relative to the shock front and denote the quantities referred 
to this system with prime. Since the shock front is at rest in the system, its position 
is a point x), and if we denote quantities ahead of the shock by a suffix (1) and those 
behind the shock by (2), the fluid is flowing with a velocity v4, into the shock and 
then into the back region of the shock with a velocity vj). As all the quantities depend 
only on x,’, the equation (7-3) reduces to 


9 (Tow + T peta) /Ox,'=0. (7-5) 


Though it is usual to assume, neglecting the viscous terms, that at the shock front 
discontinuities of fluid parameters occur, we can show that this is only an approximation 
to a complicated continuous flow when the viscosity is taken into account. Thus the 
equation (7-5) should hold across the shock front approximately and we have 


Ta +Ti2a=const. in flow. (7-6) 
or 
1 Q t nil 9° 
(mq) + Pay) (way) + pay + Theraciy = (Mp) +p) (ey) c + Pe) = Thera > (7-7) 
mown 4 
(ma) + pay) Vay Vay + Theiaay = (%) +P) Ve) Ve) ein Thea . (7-8) 


This is a generalization of Rankine-Hugoniot’s relations. 
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On the Orbital Approach to the Many-Electron Problem in Molecule 
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The lowest *2,,* and 13,,* states of the hydrogen molecule, the interaction energy between two 
normal helium atoms and the °B;,, and 'B,,, states of the ethylene molecule are treated by the method 
of the molecular orbitals in the single configuration approximation. The results are encouraging and 
the reason for this success is that we have used molecular orbitals which are a little more flexible 
than the conventional LCAO MO’s. It is confirmed that the failure of the conventional ASMO 
LCAO approximation does not necessarily imply the break-down of the useful concept of “ molecular 
orbital ”’ itself. 


§ 1. Introduction 


It is convenient if we can describe effectively an electronic state of a molecule by 
one single electronic configuration. In this approximation each electron has its own orbital 
and the physical picture is simple and appeals to our naive imagination. This approximate 
concept has worked well in qualitative discussions but, if we try to put the discussions 
on more quantitative bases by specifying the forms of the orbitals in conventional manner, 
we often find rather discouraging results. In recent years several authors have stressed 
the necessity of discarding this attractive concept of the single electronic configuration. 
Coulson et al.” stressed the significance of the so-called configuration interaction and the 
accompanied difficulties of calculation. In their paper we find, for instance, the following 
sentence: ‘“‘ Thus neither the description of a state by one single configuration wave 
function, nor of a spectral transition as a one-electron phenomenon, is sufficiently close 
to the physical facts to give a satisfactory working picture”. Moffitt’s attitude” is more 
radical in discrediting the conventional method of ASMO LCAO with CI and proposing 
a new prescription of calculation, the method of atoms in molecules. Coulson et al. are 
absolutely right in remarking that, as far as we use the conventional ASMO LCAO 
method, we should proceed beyond the single configuration approximation by superposing 
necessary number of configurations. Also, as far as we are not able to get satisfactory 
numerical results from the conventional non-empirical method within reasonable labor, we 
should estimate properly the merits of the method of atoms in molecules and other 
related semi-empirical methods. Here is, however, the problem of the physical interpreta- 
tion. It appears that the concept of the single configuration has been faded unreasonably 
by these and other influential authors. This reminds us of the history of the develop- 
ment of the theory of nucleus. In the earlier stage, the people attempted to proceed 
with the independent particle model but in the meanwhile there appeared the influential 
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wotk of N. Bohr in 1936, which provided the thereafter prevailing picture of the nucleon 
correlation in the nucleus. In 1949, however, G. Mayer and others succeeded to restore 
the honour of the independent particle model in a remarkable manner and older physical 
pictures of the nucleus had to be reexamined from the new standpoint. Of course, the 
correlation of nucleons inside nuclei may have several distinct features from that of electrons 
in molecules and the analogy here mentioned might not be suitable one. But if we can 
show, in the many-electron problem of molecules, that the orbital approach reveals far 
reaching validity than usually conceived, it is of interest by itself and a new light might 
be thrown on the physical interpretation 4 /a Moffitt of electron correlation in molecules. 

The following three sections are devoted to the description of calculations on the 
H, molecule (§ 2), the He-He interaction (§3) and the ethylene molecule (§ 4), 
performed within the single configuration approximation. The results are encouraging. 
It seems that we can expect to obtain good results in many cases by using the molecular 
orbitals which are a little more flexible than the usual LCAOQ MO’s. In the last section 
(§5) we shall discuss at some length the results of the calculations in the preceding 


sections. 


§2. Hydrogen molecule 


The 1so 2po°*X i and 'S'} states of the hydrogen molecule are treated by the 


u 


method of the molecular orbital within the single configuration approximation. The problem 


is closely related to that of the °S'{ and *S'> states of the oxygen molecule and of the 


°B,, and *B,, states of the ethylene molecule, which were treated successfully by Mofhtt’s 


new method, the method of atoms in molecules. 


(a) LCAO MO approximation 
We begin with the following two molecular orbitals of LCAO type, 


P= Ny, (aS) 3 Fy, (1) 
R= Nu (Sa! — 8). 5 Fes (2) 
Here N, and N,, are the normalization constants and 
Sq (Z*/m)” exp(—LZiq), = (Z2/x)” exp(—Zr,), 
fel = (Z"/n)" exp(—Z!r), 5 = (Z"/2)"* exp(—Z!n). 


. 3 V+ 1 SY + 
Then the wave functions for the lowest °3'! and 2’! states of the hydrogen molecule 
are, omitting the spin parts, 


PCR) = (1/2) {Py (1) Pu (2) — u(1) 94 (2)}5 (3) 
ve Cae) 2 (1//2 ) {@,(1) 9,,(2) +9,,(1) 9, (2) } 4 (4) 
The Hamiltonian of the system is (in the atomic units) 
H(1, 2)=(—+ 4 a ———)+(-4.4 te ) : 
ai rp eeor, BR isioont astra. % ess () 
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This expression includes electronic terms only. The orbital approach is exact if we neglect 
the last term, 1/r,., the repulsive interaction between two electrons. The expressions for 


the electronic energies are 


Ee, (ae?) =H,+H, St) ea a (6) 
E.C 22) =Hy+ Hat Jou Kus (7) 
where 
ts [ ue u 4.— 2 Zt ) (he 

a= \% ; pt) ete (8) 

ul if 1 
Hy= \e.(- 4— ay )eude, (9) 

s . 2 Up Ty 


Jou= \Je@ea (1/119) Ou 2), (2 )de.dr.: (10) 


Kou= |) Go(1) Pu(1) A / 12) Pa (2) Pu(2) dr, 4e0. (11) 
In this case the singlet-triplet separation is given as below, 


4=E, (27) —E, C27) =2Ky,. (12) 


Since the ground state of the hydrogen molecule is of the symmetry of ‘37 and we 


are concerned with the lowest *3'; and ‘2’; states we can calculate the energies of these 
two states by straightforward application of the variation method. The parameters are 
Zand Z’. It is expected that we should take Z>1 and Z’<1. Coulson and Fischer” 
chose the value of Z=1 as a compromise between these two requirements. We examine 


here three cases, Z—=Z'=1,°2=2' 71 and ZZ". 
fal) Z=eZ' Si, 


In the case of Z=Z’ the expressions of the wave functions, (3) and (4), can be 


recast in the following familiar forms : 
Dee G52) s(t) (2); (13) 
Poe ines (1) ¥ (2) —§,(1) 5 (2). (14) 


These expressions are usually interpreted in such a way that the 3X'* state reduces to 


two normal H atoms and the '3’* state to H* and H™ at the infinite separation. The 
correspondences to the case of the ®3'* and *3'> states of the oxygen molecule and also 
to the case of the °B,, and 'B,, states of the ethylene molecule become evident now. 
However, this interpretation loses its apparent meaning in the case of ZAZ’ and for the 
purpose of the present paper the original molecular orbital expressions, (3) and (4), are 
preferable. 

The calculations are performed at three values of the internuclear distance R, 1.0, 


1.5 and 2.0 au, special attention being placed on the case of R=2. Table 1 contains 
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the results of the calculations in the case of Z=Z'’=1. It seems convenient to show 


the values of the repulsive energy for the 2¥'+ state, which is given by the formula, 


E,C@S +) + (1/R) —(—1), where (1/R) is the Coulomb repulsive energy between two 
nuclei and (—1) means the sum of the energies of two hydrogen atoms in the ground 
1s-state. The atomic units* are used throughout this paper, unless the contrary is ex- 
plicitly stated. The calculated repulsive energies of the state in Keal/mole are found in 
Table 2. The values of the case of Z=Z’=1 are listed in the column (a-1) and if 
we compare these with the results of the accurate calculation of James, Coolidge and 
Present,” listed in the last column (JCP), we find that the assumption of L=ii=s1 
yields very unsatisfactory numerical results. In Present’s calculation on the ‘2; state,” 
the attention is focused mainly on the internuclear distance around R=2.4 au and un- 
fortunately it seems that we have no standard values of the energy of the state over the 
range of R=1~2 au, to check the accuracies of our calculations. Nevertheless, a provi- 
sional estimation based upon Present’s results gives the value shown below for the singlet- 


triplet separation 4 at R~2 au, which seems sufficiently accurate for our present purpose : 
A—v.14 an (Kk? as. (15) 


Accepting this value we see that the accuracy of the calculation under the restricted 


2 ¥'= 3 W'+ 


assumption of Z=Z’=1 is even worse for the ‘3’; state than for the “3 = state, because 


4d=0.30 in this case according to Table 1. The separation is grossly exaggerated as is 


the case for the *X* —3J'> 


« Separation of the oxygen molecule and for the *B,,,—'B,, 


Table 1. (Z=Z’=1) 


R E.@2.*) E.'S 4*) ~ 

1.0 — 1.295085 — 1.027282 0.267803 
Ls — 1.347656 / — 1.063634 0.284022 
2.0 — 1.346042 — 1.042683 0.303359 


Table 2. The repulsive energy of the °¥\,+ state in Keal/mole 


’ 


aunt Nie vi | (b) (HL) (ICP) 
442.5 24st 

142 119.8 

65.9 


(a-1): LCAO MO Z=2/=1 

(a-3) : LCAO MO ZZ’ 

(b) : James MO 

(HL): Hirschfelder and Linnett® 

(JCP): James, Coolidge and Present" 

t The value from the formula (12) of HL. 


* Length: aj=0.5292 x 10-8 cm (Bohr radius). 
Energy: e?/a)=27.21 eV=627.7 Keal/mole (twice the jonization energy of hydrogen). 
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separation of the ethylene molecule. To summarize, both the absolute energy values and 
. oy i® 3yVv 4 VES 
the relative positions of the lowest °Y; and “13"' states of the hydrogen molecule are 


given only very inaccurately in this first stage of approximation. 
(a2 La 2) Se 1, 


As the second stage of approximation, let us try to improve the calculation by 
varying Z, still keeping the restriction of Z=Z’. The calculation is performed at R=2 
only and the results are displayed in Table 3. It is observed that the optimum values 


Sable Sun ( A= 24-1, R—=2) 


= E,C2u*) E.C2y*) 
0.5 — 1.09752 — 0.963623 
0.75 — 1.29423 — 1.08121 
0.875 — 1.33590 — 1.07921 
1.0 — 1.34604 — 1.04268 
LD — 1.28062 — 0.874563 


of Z would lie around 0.9~1.0 for *2'; and around 0.8 for *2{; but for the present 
purpose it would be sufficient to select from Table 3 the next set of the energy values 
as the best combination : 


ECo2) == 1.34604. (Z=1) 
; (16) 
Et 2) )==—-1.08121, (Z=0.75) 
Then we find J=E,('2;) —E, ?5';) =0.26483. We see that E,(’2'5) is improved by 
about 1 eV but for E,(°2';) the older value with Z=1 happens to remain nearly the 
best one under the variation of Z, no improvement being brought about. As a whole 


the accuracy of calculation is still very poor. 
(a-3) ZZ’. 


We should like to stress here that it is our ‘‘ Leit-motiv” throughout the present 
work to give the freedoms of separate deformations to the bonding and anti-bonding 
molecular orbitals. For the LCAO-type molecular orbitals g, and ¢,, given in (1) and 
(2), it is effected by taking Z and Z’ as two separately variable parameters. A complete 
variation treatment concerning these two parameters is desirable, of course, but in the 
present calculation the values of the parameters are fixed by simple guess work, for con- 
venience. Table 4 exhibits the values of the parameters chosen primarily for the ae 
state» and later used also for the ‘3’; state. Notice that each set of Z and Z’ obeys 
the restrictions; Z>Z’, Z+Z/=2 and (1/2) (ZR—Z’R) =0.5. The last restriction 
permits us to use KAIK table? conveniently. The results of the calculation are shown 
in Table 5. Comparing this table (Z4#Z’) with Table 1 (Z=Z'=1)5" we _contem 
again that the calculation with the restriction of Z=Z'=1 yields very poor results (0.1 


ay=2.721 eV): The repulsive energies of the ag state calculated from E, C23 DINOE 
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Table 4 


Table 5. (ZAZ’) 


R E,@2u*) E.(3,*) 4 

1.0 . — 1.60162 —1,47751 0.12411 
© Sl 6168 | ~ 1.24193 0.21973 
2.0 — 1.38681 —1.11140 0.27541 


Table 5 are listed in the column (a-3) of Table 2. It is seen’ that we have succeeded 
to approach rather closely to the values of (JCP), still remaining wittias the single con- 
figuration approximation. (1 Kcal/mole=0.04335 eV) As for the ‘3’; state, however, 
it is highly desirable to improve the calculation further because J=0.2754 at R=2, 
while the actual value of the separation energy seems to be J~ 0.14. We have attempted 


to improve the calculation for ‘X'} by choosing the following set of the parameters : 


ZR=3.0 and 2’R=10, (Rez; 


This choice has an aim to reduce the degree of overlapping between the orbitals ¢, and 
¢,,. The results are 


E,(2*) = —1.30218, E,C 2+) =—1.13555, 4=0.16663. 


Summarizing the results obtained hitherto, we find the following values of E,(@S{) and 


uM 


E,('2'}) at R=2 as the best combination ; 


E,?@3;) =—1.38681 (Z=1.25, Z’=0.75), (17) 
17 
E,C2¢) =—1.13555 (Z=1.5, Z’=0.5). 


This set yields the value J=0.25126 as the singlet-triplet separation and it simply means 
that the energy of the ‘3’! state here obtained still remains too high. More careful 
choices of Z and Z’ will bring about better results but probably we could not expect 
much from such a source of refinement. 


(b) James-type MO approximation 


We use here the following molecular orbitals and tentatively call them James-type 
molecular orbitals : 


Po~ (1+ cy?) exp(—aé), (18) 
¢.~y-exp(—fs), (19) 
F=(r, +7) /R, A= (ta—7) /R, 
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As is well known, James calculated successfully the ground state energy of H,* by using 
the function of the form of ¢,. Now we have three parameters, namely a, / and c. 
Since it is easily seen that 2a and 2/ are just corresponding to ZR and Z’R respectively, 
we assign the same values to them for the sake of simplicity, as shown in Table 4. c 


is determined by the usual Ritz method. The results of the calculation are shown in 


Table 6. 


Table 6 
Rambsbdita@int PEG ut) wclbied sd eCLy*) c(Eut) 
1.0 — 1.60656 | — 1.49251 0.11405 +0.1539409 | +0.06550152 
1D — 1.46456 — 1.27081 | 0.19375 +0.3508958 | +0.09142185 
2.0 — 1.38686 — 1.15330 0.23356 +0.6111767 +0.1396027 
As for the *3'; state, the electronic energy values E,(*?3'3) in Table 6 are found 


to be almost identical with those of Table 5 (LCAO MO, ZZ’), and the existence 
of the parameter c does not seem to play any significant role. On the other hand, 
definite improvements are found in E,('S';) of Table 6 as compared with the correspond- 
ing values in Table 5. However, the fact that th values of « for ‘2’; are rather small 
seems to indicate that there would be some possibilities of further improvement by 
choosing the values of @ and ;7 more appropriately. Indeed, adopting a new set of these 
two parameters, 2a@=3 and 2/1 at R=2, just corresponding to the last case of (a), 


we find that 


E,@2~) =—1.34107, c=+0.515597, 
E,CX*) =—1.21097, c=+0.278868. 
Here is obtained a remarkable improvement for E,('2';). Thus, the best combination 


of E,@S;) and E,('3';) at R=2, obtained by using the James-type molecular orbitals, 


1S 


E,C@S*) =—1.38686, (2a=2.5, 28=1.5, c=0.6111767), on 
20 
E,CSt) =—1.21097, (2a@=3.0, 28=1.0, c=0.2788684), 


and accordingly 
4d=0.17589. (R=2). 


This is in tolerable agreement with 40.14. In summarizing the results here obtained, 
we may say that not only the singlet-triplet separation but also the absolute energy value 
of the *X* and 13f states are estimated with rather good accuracy within the single 
configuration approximation, since the value of E,(*2';) in (20) is sufficiently close to 
the actual value, as already remarked. In this connection it is of interest to remind 
that we have already obtained rather good value of ExC 23) vate R22. ECL) == 


1.20587, by the one-center expansion method” in the single configuration approximation. 
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This value is definitely better than the best value — 1.13555 of the LCAO MO approxt- 
mation in (17) and that of the James-type approximation found in Table 6. We shall 


return to this point again in the last section of the present paper. 


§ 3. He-He iateraction energy 


The calculation of the interaction energy between two normal helium atoms is one 
of the most fundamental problems in the quantum theory of molecular systems. If we 
try to tackle this problem by using the method of the molecular orbitals in the single 
configuration, it is natural to adopt one bonding orbital ¢, and one anti-bonding orbital 
¢, and allot two electrons to each of these orbitals according to the Pauli principle. 
Explicitly, the wave function for the system consisting of two normal helium atoms 


separated by finite distance can be written 
P= (1/24)"" 3) (—1)* Pal @g(1) & (1) $5 (2) 8 (2) Pu(3) (3) Pu(4)3(4)], (21) 


where P, is the operation permuting the electron coordinates, and @ and j are the usual 
spin eigenfunctions. The functions, @,, @,, @ and j3, are all supposed to be normalized 


to unity. We write the energy operator for the system as 
4 4 
H= H(i) + 3) (1/4) + 4/8) (22) 
i= isj= 


where 
FHi(é) = — (1/2) 4,— (2/tas) — (2/114) - (23) 


Here (4/R) is the Coulomb repulsive energy between two nuclei. Then the energy of 


the system is written out as 
jo \ V AY d= =2H,+2H,4+Jog+Juu+4 Jou — 2K, + (4/R), (24) 


where the expressions of H,, H,,, J and K,,, are as given before in (9), (10), (11) 
and (12), respectively, and 


Juu= \) $02) $01) (1/r2) G52) G42) desde, (25) 


Juu= | Pu(1) uC) (1/3) Gu(2) Gal 2) ded. (26) 


Just as in the calculation on the hydrogen molecule in the preceding section, various 
approximations are obtained according to the ways of specifying the forms of ¢, and ¢,. 
Grifing and Wehner™ have made calculations corresponding to the cases (a-1) and 
(a-2). Murai’ has performed a calculation on the present subject corresponding to 
the case (b) with considerable success. In the following we shall describe briefly two 
calculations carried out by the present author. The first one™ is the He-He version of 
the case (a-3) with Z54Z’ for the hydrogen molecule. In the second one™ we try 
to include the effect of polarization of the basic atomic orbitals by adding the 2pco 
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atomic orbital to the 1so atomic orbital. This idea is found in the earlier works of 
Dickinson"? on Hs and of Rosen" on H,. Griffing and Wehner’ have also made use 
of this in order to calculate the van der Waals attraction energy between two normal 
helium atoms. 

(I) LCAO MO approximation with ZZ’ 

The molecular orbitals here used are the same as those in (a) of § 2 but it seems 


convenient to repeat them below : 
Po=Ny (Sat 5) 3 ys 
FARE ONAN IES) BR eg 

et) exp(— Gi.) 5, = (Z*/z) * exp(—Z7,), 

Sq = (2/7) exp(—Z'r,), = (Z"/m)" exp(—Z'r,). 


The values of Z and Z’ are determined largely through guess work similarly to that in 
(a-3) of the preceding section. Actual calculations were performed at three values of 
the internuclear distance R, 1.0, 1.5 and 2.0 au, and the results are summarized in 


Table 7, together with the values of Z and Z’ adopted. E(R) is the total energy of 


Table 7 
R | ZR | ZR E(R) V(R) 
a | | ~ 0.98947 au 
2.25 | 125 | 4.70584 (26.92 eV) 
LS | : es 0.36750 au 
3.0 2.0 | 5.32781 (10.0 eV) 
2.0 3.5 3.0 | — 5.58131 ve ey 


the He-He system and V(R) is the repulsive potential energy defined by E(R) —E(~). 
For convenience, we took E(0o) = —5.69531 au in accordance with Grifing and Wehner" 
and Sakamoto and Ishiguro." The result of the present calculation may be expressed 


conveniently in the following approximate formula : 
V (R) =8.58817 exp(—2.16097R) au. 10 RS2 (R in au). (27) 


We have prepared Table 8 for the purpose of comparing the result of the present calcu- 
lation with those of other calculations and the experimental data, where the figures listed 
in the column (4) are calculated from the above approximate formula (27). The column 
(3) indicates the result of Sakamoto and Ishiguro which is not significantly different 
from that of Griffing and Wehner under the restriction of Z=2Z’, listed in the column (2). 
We see that the improvements achieved by dropping the severe restriction of equal effec- 
tive charge Z=Z’ are rather remarkable in this case. Especially striking is the fact that 
the elaborated calculation of Rosen’ yields very exaggerated values of the repulsive energy 
at small internuclear distance as shown in the column (1). Rosen started with two 
atomic orbitals of different effective charges but went forward by using the valence bond 
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. 1 > 
method as contrasted with the molecular orbital method adopted here. If Rosen’s calcu- 
lation is numerically reliable, this situation seems to impose a sericus question on the 


validity of the valence bond method for the present and other similar problems. 


Table 8. He-He repulsive potential (in eV) 


R (au) (1) | (2) (3) (4) (5) 
0.8889 72.96 51.84 49.94 34.23 10.96 
1.0 56.24 | —_ 38.41 26.92 9.011 
1.185 36.53 24.86 24.78 18.02 6.647 
ip 35.30 —_ 23.94 17.48 6.502 
1.4 22.16 — 14.92 11.34 4.934 
1.6 13.91 —_— 9.294 7.363 3.885 
1.778 9.261 6.212 6.115 5.013 3.216 
1.8 8.730 _— 5.792 4.778 3.147 
2.0 5.481 — 3.609 3.102 2.606 


(1) Rosen!) 


(2) Griffing and Wehner!) | 


' (3) Sakamoto and Ishiguro’ \ according to Table V of Sakamoto and Ishiguro) 


(5) Experimental result of 
Amdur and Harkness!) / 
(4) Present calculation (I) 


(II) Inclusion of polarization along the internuclear axis. 

The primary purpose of the preceding calculation (I) was to diminish the large 
descrepancy between theory and experiment at small internuclear distance. This is the 
reason why we performed the calculation only over the range of 1X RX 2 au. How- 
ever, the force range which is physically most important and of direct interest may 
probably be the van der Waals region and the neighboring inside. We shall report here 
an attempt of consistent quantum mechanical treatment of the interaction between two 
normal helium atoms from the repulsive region to the attractive van der Waals region. 
We use the following molecular orbitals : 


%,=N,y| (Sat 5) +4, ( Pat po) > Fos (28) 
Pu=Nal (Sa—5) +A. ( Pa po) | 3 Tuy (29) 
where 


Sa= (150) a= (Z°/z)"™ exp(—Zr,), \ 
y= (so) ,= (Z3/z)'” exp(— Zr), | 
Pa= (2po) a= (Z°/7)' 1, cos 0, exp(—Zra), 
Po= (2p) = (Z*/z)'" 1, cos, exp(—Zry). / 


4, and A, are the variable parameters and N, and N,, are the normalization factors which 
depend on 4, and /, respectively. To simplify the integrals, a fixed value of Z=27/16 


(30) 
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was used commonly for 1lso and 2po 
orbitals throughout the whole range of the 
internuclear separation. The above scheme 
of functions is essentially identical with 
that used by Griffing and Wehner™ in 
their attempt to calculate the van der 
Waals attraction. The wave function of 
the system is given by (21) and the 
calculation goes just the same way as in 
(I). The dependence of E(R) on /, and 


4,, at each distance has bzcn examined, as 


a rule, at intervals of 0.5 in significant Fig. 1 

figures. The results of the calculation are 

summarized in Table 9. It is observed that /, is positive and /, is negative consistently 
for the whole range of the internuclear distance. This implies that the orbital ¢, has a 


tendency to concentrate in the inner region and the orbital @, wants to extend itself to 


Table 9 
R | Potential hy he 

(au) (au) (eV) 

1.185 0.79581 21.65 0.010 01225 

1.778 | 0.20957 5.702 0.015 —0.085 

2.370 0.053089 1.445 0.010 —0.040 

2.963 0.012318 | 0.3352 0.010 — 0.0245 

3.556 0.002507 | 0.06821 0.0085 —0.0150 

4.148 0.0004012 0.01092 0.0060 —0.0075 

4.741 0.00003644 0.000992 0.0040 —0.0045 
*5_877 | —0,0000214 | —0.0005823 0.0014 —0.0014 


* The result of this last line is due to Grifing and Wehner.}° 


the outer region. For four larger values of R in Table 9, 3.556, 4.148, 4.741 and 
5.877, the well-known Slater-Kirkwood potential formula gives 2.314 107° au, 4.222 
X1074 au, 3.466X10~° au and —2,.696%10~° au, respectively, as the potential energy 
values. On the other hand, the repulsive energies of 21.65 eV at R=1.185 au and 
5.702 eV at R=1.778 au are favorably compared with the corresponding values of 
Grifing and Wehner (4,=/,=0), 24.86 eV and ‘6.212 eV respectively, where they 
failed to improve the values by varying the effective charge parameter Z. It is instruc- 
tive to note that the same kind of deformations of the molecular orbitals in the single 
configuration approximation can cover rather effectively the whole range of interaction 
between two normal helium atoms, from the repulsive region to the van der Waals 


region, with tolerable accuracy. 
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§4. Ethyleae molecule 


The ASMO LCAO methcd in its usual form utterly fails to give properly the 
value of the singlet-triplet separation of the excited B,,, states of the ethylene molecule 
On the other hand, in treating theoretically this molecule by the method of atoms in 
molecules, Moffitt!” succeeded to obtain remarkably good agreement between theory and 
expetiment. This success seems to be one of the cornerstones of his method and his 
philosophy. 

We shall treat the ethylene molecule in its stable form only, that is to say, assum- 
ing it to be planar. In the 7-electron approximation our subject is essentially a two- 
electron problem. If we prepare one bonding molecular orbital 0 and one anti-bonding 
molecular orbital ¢* for the two electrons, the wave function of the three lower states 


are given as follows in the single configuration approximation : 


P yCA,) =0(1)6(2), (31) 
P , ?B,,) = (1/v 2)[d(1) 6* (2) —d* (1) 6(2) J, (32) 
Py CB,,) = (1/v 2)[d (1) d* (2) +4* (1) 46(2) J, (33) 


where ¢ and ¢* are assumed to be normalized to unity. The explicit forms of 6 and 


é* in the conventional LCAO MO approximation are 
G=N[(2p7) a+ (2p7),],  O*=N*[ (2p=).— (2p7),], (34) 


where (2p), and (2p7), are 2p7 atomic orbitals at the carbon nuclei @ and 4, respec- 
tively. The atomic orbitals contain the effective charge parameter, say Z. Usually the 
value of Z is fixed ab initio and the same value is used commonly for Vy, VY, and Fy. 
Roughly speaking, this stage of approximation corresponds to the case (a-1) of § 2. 
Moser” and others carried out the calculations in this approximation. An interesting 
calculation corresponding to the case (a-2) of §2 has been performed by Murai.”? 
The results of these calculations are found in Table 11 and we see the results are not 
encouraging, especially revealing the notorious defect of exaggerating grossly the singlet- 
triplet separation of the B,, states. It is highly desirable to perform the calculation 
which corresponds to the case (a-3) (Z#Z’) of $2 by using the orbitals ; 


G=N[(2p7).+ (2p%),], g*=N*[ (2pz)q’— (2p=),’], (35) 


wheré primes in §* indicate that we use different effective charges for 6 and ¢*. How- 
ever, if we try to make such a calculation non-empirically, we have to devote a large 
amount of labor to the numerical evaluation of various molecular integrals. The same 
is true if we want to make a calculation similar to the case (b) of § 2 on the ethylene 
molecule. Let us remind, however, that the one-center expansion method has been applied 
to the ‘*'{ state of H, even more successfully than the LCAO MO method with ZAZ, 
as remarked in the last paragraph of § 2. Then it is natural to try to apply the one- 


center method to the ethylene molecule as a substitute for more orthodox calculations to 
be performed in the future. 
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The simplest choice of the one-center molecular orbitals for the present problem 
would be to take 


O= (2p7) =R,(7) Su (9, 9), > (36) 
R, (7) =N (2, €)r exp(—€r), 
Sa (G,- 9) = (1/2) "[¥1,-1, 9) + (—1) Yi, 9) J, 
¥i2:(6, ¢) = #(3/87)'” sind exp(+i¢), 
$* = (3d7) =R,(r)5.,(0, 9), > d (37) 
Ra(t) =N (3, 6)1° exp(—ar), 
Sx (9, g) = (1/2)? [Y2,1(6, 9) + (1) Ya, 8, 9) ], 
Yo,41(6, ) = F (15/87)'” sind cos exp ( ig). 
Here the reference center of the electron coordinates is located at the middle point between » 
two carbon nuclei. (See Fig. 2) € and 0 are the variable exponent parameters and 
N(2, €) and N(3, 0) are the normalization factors for the respective radial functions. 


By using simplified notations, p and d, indicated above, the wave functions of the three 


states in question can be written 


P y(A,) =p(1) p(2), (38) 
PF » By) = (1/V 2)[ p(1)d(2) —d(1) p(2) J, (39) 
Py 1By,) = (1/2) p(1)d(2) +4(1) p(2) ]. (40) 


The Hamiltonian for the system may be written 
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H=T (1) +7 (2) +H,(1) +H,(2) + G/n) (41) 


where T'(v) is the kinetic energy of electron », H,(») is the mutual potential energy 
of electron » and the single-bonded H,C-CH, framework, and 1/r,. is the repulsive 
potential between electron 1 and 2 (in the atomic units). Various expressions have been 
used for H,(¥) and we shall adopt tentatively the simplest of them after Labhart,~ 
namely, 


A, (v) =a —Z(1/Ta,+ i/tn) > (42) 


where 7,, and 7,, are the distances of electron » from the carbon nuclei 4 and 6, respec- 
tively. As for the values of the parameter Z, Labhart assumed two values, 3,25 and 
2.04. Correspondingly we adopt here two values of Z, 3 and 2, and see what happens 
in the relative positions of the three energy levels in question. 

In the present one-center method the molecular orbitals, 6 and ¢*, are both centered 
on the middle point between two carbon nuclei and consequently the calculation offers 
no troubles. For the sake of convenience, the carbon-carbon distance is assumed to be 
QoL au (41.323 A). The parameters, £ and 0, are determined in the following way. 
First we consider the case where one electron occupies the orbital 6=(2p7) and C is 
chosen so that the energy of the system, (2p7|T+H, 2p), is minimized. Actual 


minimization process was performed only at intervals of 0.1 and the results are 


(<=2). C=1L.ly and. (Zea CT: 


> 


These values are used for the p orbital in Y,(°B,.) and ¥,('B,). Next, as for 
Y y('A,) =p(1) p(2), we calculate the energy of this two-electron system and the values 
of € are determined to be 


(Z=2) (€=10 and (Z=3) C=1.3. 
Corresponding values of the energy Ey('A,) are 
(Z=2) Ey('A,) =—2.00878 au and (Z=3) E,('A,) = — 3.89346 au. 


Finally we calculate E,(*B,,.) and E,('B,,.) by varying the parameter of the 3d7 orbital 
and the results are shown in Table 10. 


Table 10 

Le & ' 3 Er @Byu) | Ey (Bx) 
0.9 —1.75293 . — 1.68484 
2 11 1.0 —1,79119 / — 1.70796 
Ll — 1.79943 —1.70220 

12 —1.79197 / — 1.68233 
13 — 3.59534 — 3.48544 
3 4 1.4 —3.62030 — 3.49655 
18>) — 3.62871 — 3.49237 
— 3.61994 — 3.47240 


On the Orbital Approach to the Many-Electron Problem in Molecule 29 


We pick up the best combinations of E,(°B,,,) and E;('B,,) from Table 10 as follows ; 
E,=—1. = j= 
peer (Er=—1.79943 a (C=1.41, 0=1.1), 
|E,=—1.70796 ats a(G— ale 0-110). 
(Z=3) (Ex Zc62871Vaun) MCD. Aes Ss)} 
Rear ote Reig Bly’ p 
Then we can now calculate the relative positions of the three energy levels of the system, 
"By, °B,, and 'A,, referring to the value of E y('A,) given before. The results are shown 
in Table 11, together with those of various other calculations and observed data. As is 
seen from Table 10 and 11, in spite of the drastic changes in absolute values of energy 
between the cases of Z=2 and Z=3, the telative position of the energy levels suffers 
comparatively small changes. The situation does not alter if we put a certain amount 
of negative charge on the middle point between two carbon nuclei. This modification 
seems to be of some use for obtaining reasonable value for the ionization energy of the 
molecule but too much elaboration on this point does not seem to be worthy. After 
all, if our simplified assumption (42) for H,(¥) is acceptable, at least provisionally, the 
results exhibited in Table 11 may be found rather interesting. The result of the present 
simple calculation seems to indicate good possibility of the successful orbital approach to 
the present subject, in spite of recent critiques of Moffitt and others. 


Table 11. Electronic states of ethylene* (in eV) 


This paper 
State ASMOz«z Murai’ | Ohno-Itohe | Moffitt Obs.@ 
Z=2 Z=3 
V CB,,,) 11.4 11.2 77 7.3 7.6 8.2 10.8 
T (By) 2.8 4.6 5.4 5.5 6.4) $2 72 
N(CA,) 0.0 0.0 0.0 0.0 0.0 0.0 0.0 
V—T 8.6 6.6 Zoe 1.8 eZ, 2p oe 3.6 


See reference 20. 

See reference 21. 

K. Ohno and T. Itoh, J. Chem. Phys. 23 (1955), 1468. 

See reference 19. } 

The internuclear distance of 1.353 A is used in a, 6, c and d, while in the present calculation the 


Palos of 1323 Ais adopted. 


ro RTS doy asst 


§ 5. Discussions 


(a) 
It has been stated in (a-3) of § 2 that the leading motif of the present work is 
to give the freedoms of separate deformations to the bonding and anti-bonding molecular 
orbitals. The importance of such a procedure was first noted for the hydrogen molecular 


ion by Coulson® in 1937. Roughly speaking, the bonding orbital tends to contract 
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inwardly and, on the contrary, the anti-bonding orbital wants to extend itself outwardly, 
these tendencies being understood most easily from the united-atom point of view. 

In the present calculations we have used several kinds of device to give such freedoms 
to the molecular orbitals. The first of them is to assign distinct effective charges to 
the bonding and anti-bonding orbitals in the LCAO MO method. This device is quite 
successful in the calculation of the repulsive potential between two normal helium atoms, 
as described in (I) of § 3, and gives satisfactory result concerning the “+, state of the 
hydrogen molecule, as reported in (a-3) of § 2. It fails, however, to estimate properly 
the energy of the '2} state, giving too large value for the singlet-triplet separation of 
the 3'* states. Fortunately we have succeeded to overcome this difficulty by employing 
another kind of molecular orbital called James-type tentatively in (b) of $2. Let us 


repeat the expressions here, for the sake of convenience : 
G,~ (1+¢77) exp(— a) » Pu» exp ( — =), 
E=(ra+r)/R, 7=(ta—M) /R- (43) 


Now let us examine the bearing of the 
parameters a, # and c a little more closely ¥9 
than in § 2. In Table 6 we notice that the 


3 y+ 


7 


values of c are remarkably different for 
and “2,7. At R=2) form example’? = a b 
0.6111767 for ?X* and c=0.1396027 for 
Poe inthe «case (ofl. Ge 225 715). 
What is meant by this large difference ? 

To answer this question, let us digress 
briefly to recall that it is usually told in such 
a way that in the single configuration ap- - : 
proximation of the ASMO method the wave a 
function which consists essentially of the 
products of one-electron orbitals gives the Fig. 3 
correlation between electrons with parallel . 
spins because of the antisymmetry requirement but cannot include properly the correlation 


effect between electrons with opposite spins. Probably this is the reason why we can be 


success axe PN i 
ful for the °S'; state but not for the ‘Y* state of the hydrogen molecule in the 


LCAO MO method with two parameters of Z and Z’. Overlapping of the two electron 


orbitals hee i i i 
would be too large for the 'S; state with two electrons of opposite spins, while 


it is moderated appropriately by the antisymmetry of the wave function for the “2'* state. 
We shall now return to our James-type orbitals. The parameters @ and § Elbcnttdl 
to Z and Z’ of the LCAO MO approximation but here we have an additional para- 
meter c, by which we can adjust further the degree of spatial overlapping between the 
two electron orbitals. As is seen from the expression of ~, in (43), small c means that 
the electron distribution tends to accumulate in the space between the two nuclei and, 


on the contrary, large value of ¢ implies that the electron concentrates at the positions 
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. 1 > EX . 
of the nuclei. For the "2 state we have small value of c in order to reduce the etror 


Re 
ever, the wave function Po (11) Pu (te) — Puls) Gy (tT) vanishes automatically when 1,=7, 


and then we are able to increase the value of c, letting the electron feel the nuclear 


due to the interaction between electrons with ‘Opposite spins. For the *3'' state, how- 


attraction more effectively without worrying too much about the electron-electron correla- 
tion. (Actually, the value of c is determined by the Ritz method.) As is noted in 
(b) of § 2, however, c=0.1396027 for ‘3S’ in the case of (2a=2.5, 261 75, eR=2) 
seems to be too small, probably reflecting the fact that the choice of a and f is not 
proper and the parameter c is forced to be too small to separate y, and ¢,, spatially, in 
order to compensate the loss due to the wrong choice of @ and 9. Indeed, if we contract 
¢, a little more by taking 2a=3 and extend ©, by taking 28=1, instead of 2a=2.5 
and 23=1.5, we obtain the improved value of E, (13) = —1.21097 au with c=0.2788684 
at R=2, this value being considerably larger than the former value of 0.1396027. 

We may say that the deformation afforded by the parameters (Z, Z’) or (a, ) 
is essentially of one-electronic in nature. The well-known result on the one-electron system 
of H; supports this point of view. The demand for the flexibility owing to. the para- 
meter c may be called as partly one-electronic and partly two-electronic in view of the 
above considerations. Anyhow, it seems to be reasonable to regard these deformations 
as of inter-atomic nature rather than of intra-atomic nature, according to the terminology 
of Moffitt. 

(b) 

Making an analysis of certain observed and calculated features of the oxygen mole- 
cule, Moffitt’? concluded that it is mainly in its inability correctly describing the term 
values of atomic (stationary and valence) states that the ASMO method breaks down. 
As a result of this analysis, a different method, the method of atoms in molecules, was 
put forward by Moffitt.” The basis of this new method is to separate the problems of 
atomic from those of molecular structure. Mofhtt’s argument is ingenious and quite 
successful in some cases but it seems to be neither definitive nor unambiguous. He starts 
with the analysis of the asymptotic behavior of the wave function of molecular system 
in the limit of infinite separation. In view of the results of the present paper, however, 
it is clear that we are not always justified in drawing definite criteria for ‘“‘ molecular’ 
orbitals from such a procedure. For example, the wave functions of the lowest 3S t and 
1Y’* state of H, consisting of the James-type orbitals are rather good at the internuclear 
separation of actual molecular configuration but they are bound to behave very incorrectly 
at the infinite separation, as is readily seen in (43). More simple and impressive 
examples are found in the successful applications of the one-center expansion method. to 
molecular systems. Recently the present author” calculated the repulsive potential between 


two normal helium atoms at small distance by the united-atom approximation. The 


molecular orbitals used are 
Co (1s) =N(1, C) exp(—¢r) yas 
Q.= pc) =N(2,.9)r exp(—77) Yio. 
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The origin of electron coordinates 
is located at the middle point a 
between two nuclei as pictured 
in Fig. 4. Even at the distance 
of R==1.0 au these seemingly 
very crude molecular orbitals work " b 

definitely better than the elabo- 

rate valence bond wave function 

of Rosen.” It is to be noted Fig. 4 

that Rosen’s wave function is 

excellent at infinite separation, while the united-atom wave function breaks down completely 
there. In addition, even at small separation one-center wave function describes only very 


’ region around nuclei as is seen from Fig. 4, while 


“ atomic’ 


crudely the electrons in the 
Rosen’s function is accurate in the “atomic” region. The unexpected success of the 
one-center molecular orbitals strongly ‘suggests that in calculating the repulsive energy 
between two helium atoms at small distance the deformations of the molecular orbitals 
over the whole molecular dimension are essentially important. Matters are quite similar 
for the treatment” of the lower excited states of the hydrogen molecule by the one- 


center expansion method. For the ‘2’; state at R=2 we have used the wave function, 


P CST) =(1/V 2) [¢, (1) Gu. (2) +¢.(1) &, (2) ], 
where 
$,=0.6614319| (Iso ; 1) +0,51260(4se ; 3) +0.2707095 (4do ; 3)], 
(nso 3 €) =N(n, ()r"“' exp(—Cr) Yo, 
(ndo ; €)=N(n, €)r"“" exp(—Lr) Yu, 
N(n, €) = (22)""1"[ (2a) 17", 
Gu= (2pe ; y) =N(2, y)rexp(—7r) Yj. 


Again, the center of the electron coordinates is located at the middle point between two 
nuclei. This function yields the value of E,('Y~) =—1.205872 (y=0.5) as the elec- 
tronic energy of the ‘3’; state at R=2, as remarked in the last part of (b) of § 2. 
This result is favorably compared with the results of the LCAO MO approximation, for 
example, —1.08121 in Table 3 (=2'’#1) and —1.11140 in Table 5 (Z4Z’). 
Thus we have been able to obtain rather good energy value of the ‘2’; state by the 
one-center expansion method, in spite of the apparent crudeness of the wave function not 
only in the large internuclear suparation but also in the vicinity of nuclei at small distance. 
The reason for this success is essentially the same as in the calculation based upon the 
James-type molecular orbitals: if we give appropriate flexibilities to molecular orbitals and 
utilize the whole space around the molecular system to reduce the error due to the 
electron correlation, we can obtain rather good results even within the single configuration 


approximation. Similar situation may naturally be expected in the case of the *B,,, and 
at : 7 : 
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'B,,, states of the ethylene molecule. A preliminary calculation reported in § 3 seems to 
confirm this expectation. The same would be expected also for the *3'} and *2'> states 
of the oxygen molecule. 

(c) 

The failure of the conventional ASMO LCAO approximation does not necessarily 
mean the break-down of the useful concept of ‘molecular orbital” itself. The principal 
aim of the present paper is to testify this by presenting several examples. 

Needless to say, it is absurd to stick to the orbital approach and remain within the 
single configuration approximation spending too much labor, if we are supplied with new 
efficient and reliable calculation schemes by giving up the simple orbital description. The 
merit of the various semi-empirical methods, such as of Moffitt and of Pariser and Parr,” 
is that they enable us to treat efficiently rather complicated problems with reasonable 
labor. Also in regard to the so-called non-empirical method, if we can improve the 
result of the first approximation sufficiently by including only a few configuration interac- 
tions, it is meaningless to try to stay within the single configuration approximation by 
elaborating the forms of orbitals. 

In this connection, however, it is to be noted that the new method of Moffitt or 
something like that is very useful in some cases but, after all, never constitutes a rigorous 
_ theory by which we are guaranteed to reach the final truth from the theoretical side. 
On the other hand, nothing prevents us, at least in principle, to reach the final goal by 
using the conventional ASMO method with configuration interactions. But, actually, 
here arises the serious difficulty of poor convergency in the procedure of configuration 
superposition. A typical and very instructive example for this is found in the work of 
Taylor and Parr” on the ground state energy of the helium atom. We learn from 
their results that it is very difficult to compensate the error due to the wrong choice of 
the starting single configuration by the procedure of configuration superposition. We 
see that if we start from the single configuration (15) (E=—2.8477 au), we reach 
E=—2.8719 au at best by superposing four configurations, (15)*, (2p)°, (3d)* and 
(4f)°, while single open shell configuration (1s1s’) yields E=—2.8757 au and with 
four configurations, (1s1s’), (2p)*, (3d)* and (4f)°, we can reach an excellent value 
of E=—2.8974 au, experimental value being E=—2.9035 au. Murai® has noticed 
that matters would be quite similar also for molecular calculations. If it is the case, 
effort to look for the good wave function in the single configuration, such as described 
in the present paper, will become necessary for future calculations aiming at rapid conver- 


gency and high accuracy. 
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In this paper we study what kind of mathematical framework is adequate to the future quantum 
theory of fields, which is physically correct and mathematically rigorous. From the physical point of 
view, we assume that the field quantities form a *-algebra and that the states are linear functionals 
on it. Field quantities must have many physical properties, and so the *-algebra must have corres- 
ponding properties, which we shall describe in a mathematically precise manner. From these properties 
we show that this *-algebra can be represented by an operator algebra in a separable Hilbert space, 
and that states are closely related to vectors of this space. 


§ 1. Introduction 


In quantum mechanics it is assumed that the set of all state vectors of any physical 
system forms a separable Hilbert space, and that any physical quatity can be represented by 
an operator in this space. In order to study quantum mechanics with mathematical 
rigour, these are the most essential assumptions, on which quantum mechanics is system- 
atized to be a complete physical theory. In general, the above assumptions are accepted 
also in the quantum theory of fields, yet it is open to question whether they are quite 
adequate or not. For example, if any interaction of two fields causes creation and an- 
nihilation of particles, the Hilbert space cannot be separable because the system has an 
infinite number of degrees of freedom’. Further, though field operators ¢/(x), etc., play 
very essential roles in the usual field theory, the mathematical contents and roles of these 
operators are not yet clarified, and they are customarily considered to be merely symbolic 
[2), p. 13]. Some authors assert, therefore, that the above assumptions are not adequate 
and that this is the reason why attempts to build a rigorous mathematical basis for the 
gantum theory of fields within the framework of a Hilbert space always end in unsatis- 
factory results’. This assertion may be correct to a certain extent, but even if it is the 
case, the problem, what kind of mathematical framework is adequate, has hardly been 
studied yet. To solve this problem is the main purpose of this paper. 

Though the renormalization procedure is still mathematically incomprehensible, the 
usual quantum theory of fields has succeeded in some respects by means of this procedure. 
Therefore we may say that, in future, we shall be able to build a satisfactory quantum 
theory of fields which is physically correct and mathematically rigorous, and that quantum 
mechanics and the usual quantum theory of fields show some signs of correctness which 


will survive in the expected future theory. Now we must study what parts of the usual 
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quantum theory of fields will remain valid in the future theory. For this purpose, we 
put a set of assumptions each of which seems to us to be very natural from the physical 
point of view, especially from the standpoint of the usual quantum theory of fields, and 
we study to what conclusions these assumptions lead. We treat fundamental but rather 
formal problems in this paper, and characteristic problems of the field theory, which will 
be connected with the divergence difficulties, will be treated in the next paper. 

In quantum mechanics we may consider two representations, the Schrodinger and 
the Heisenberg representations, but they are mathematically equivalent and are transformed 
into each other by some unitary transformations. We do not know, however, to what 
extent holds the correspondence between quantum mechanics and the quantum theory of 
fields, and so it is necessary to decide in which representation we must start our consider- 
ations. This problem will be equivalent to the following : which are more essential, 
physical quantities or states? In quantum mechanics it seems to us that state vectors 
are more essential than physical quantities, but, in the usual quantum theory of fields, 
especially in view of the method of calculations of this theory, we may say that physical 
quantities play more essential roles than states. In fact, the kernel of the method of 
second quantization which connects the two theories, is to regard state vectors in quantum 
mechanics as operators in the quantum theory of fields. Thus, we may say that in 
quantum mechanics states play an essential role, while in the quantum theory of fields 
physical quantities are more essential. Therefore, we believe that in the quantum theory 
of fields physical quantities and algebraic operations for these quantities are most essential 
and that they will remain valid in the future theory. But, as we have stated in the 
beginning of this section, it will be inadequate to accept field operators themselves as 
physical quantities and, at the same time, to accept usual algebraic operations for them. 
We can, however, construct an operator ¢, from field operators ¢/(x), as follows: Let 
7(x) be any appropriate function and put ¢,= j¢/(x)7 (x) dx, then this ¢/, can be considered 
as something like an average of ¢/(x) and we can treat it more consistently than ¢/(x) 
[2), p. 15]. In this paper, therefore, we treat only these consistent operators and we 
call them field quantities in contrast to field operators ¢/(x), etc. 

To accept the usual algebraic operations for field quantities will be equivalent to 
assume that the set of field quantities forms a *-algebra Y['. In any state, we shall be 
able to measure any field quantity and to get a definite mean value, namely an expecta- 
tion value, and hence we assume that a state is a linear functional on the set of field 
quantities. Thus the objects of our study are the *-algebra and linear functionals on it. 
From physical requirements field quantities must have many properties, which we shall 
describe in a precise way in terms of the mathematical conditions to be satisfied by the 


“algebra 2. Of course, we cannot say that the correspondence of the physical properties 


' In quantum mechanics physical quantities are often assumed to be self-adjoint, and then the product 
of any two physical quantities is not necessarily admitted, but, in quantum theory of fields, operators which 
are not self-adjoint play an essential role (creation and annihilation operators, etc.). We < not, therefore. 
assume in this paper that all field quantities are self-adjoint, and so we can assume renal nel th o 
product of any two is admissible, . cM 
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of field quantities to the mathematical forms of assumptions is determined uniquely. 
But we believe that these assumptions are very “suitable for the corresponding physical 
requirements. Furthermore, it will be almost inevitable from the standpoint of present- 
day physics to put these assumptions or mathematically equivalent ones, and if we do 
not put these assumptions, we shall be in a very difficult situation (not only mathema- 
tically but physically). On these assumptions we shall show that Y{ can be represented 
by an operator algebra in a separable Hilbert space, and that, on a set of commutative 
self-adjoint quantities, any state can be represented by a vector of the Hilbert space. 
From the standpoint of mathematics, the main problem in this paper is that of 
representations of *-algebras. There are many studies concerning representations of the 
Banach algebras and their relations to quantum mechanics”, but the fundamental notion 
in these theories, namely that of a norm, seems to us to be physically unacceptable. 
Indeed, in quantum mechanics and in quantum theory of fields, many unbounded operators 
play a most important role. To the author’s opinion, the most fundamental property of 
field quantities is shown in the fact that we can give an expectation value to any quantity 
in any state, and so we use this property exclusively. In this respect, we may say that 


we use Semi-norms instead of norms in the Banach algebra. 


§2. Field quaatities and states 


Let Y{ be the set of field quantities of a given physical system, and a, 6,--- be its 
elements. Let § and SR be the complex and real number fields respectively. As stated 
in § 1, we assume that the usual algebraic operations for field quantities are adoptable 
in our case. Namely, we assume that, if we and a, bel, there are defined a+6, ab 
and aa in YY, and YY is an algebra over SY [4), p. 11]. In quantum mechanics there 
is an identity operator I in a Hilbert space, and if we want to regard any c-number a 
as a physical quantity, we must replace it with an operator al. Therefore we assume 
that there is a unit element e in Y{. In the usual theory every field quantity has an 
adjoint, and this fact has a very essential meaning. For example an adjoint of a creation 
operator is an annihilation operator. Thus we assume the existence of an adjoint a* of 
any element a in Y{ [4), p. 499]. Summing up these assumptions we put 

Assumption 1. is a *-algebra over R with a unit element e. 

Here a *-algebra is an algebra in which every element a has an adjoint a* subject 
to the conditions : 

Gary (a*) Resa, 

(ii) (a+6)*=a*+6b*, 

(i) aah) * = b* a", 

(iv) (a@a)*=aa*(aeR and @& is the conjugate complex number to a). 

An element aé2{ is said self-adjoint if a=a*, and it is clear that the set Yl, of all 
self-adjoint elements of { is a vector space over Rt, and that every element ae€Y{ is 
uniquely expressed as the sum a=a,+ ia), where both a, and a, are self-adjoint. 

Let f, g,:++ be states of the physical system under consideration, and ©, be the set 
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. ~ 
of these states. -We assume that measuring any field quantity aeY,, in any state fESo 


we can get a definite mean value called expectation yalue, and can denote it by f(a). 


Further, we assume that 
f(aa+ Pb) =af (a) +f (6), a, PER, 


namely the state f defines a linear functional on Y,,, and that if we know the value 
f(a) for all aeQf,,, the state f is completely defined. So we put 

Assumption 2. Any state f is a linear functional on A... and satisfies the following 

conditions : 

Gotef ©) =. 

Cain f(z) zz0 sfor call. ae, 

(iii) if a*z0, there is a state f in which f(a) 0. 

As a= (1/2) (e+a)?— (1/2) (e +a’), it is clear that f(a) is real for all ae... 
The condition (i) assures the normalization of states, and the condition (ii) may be 
interpreted as follows: if ae€Y{,,, a° is non-negative in a sense |see § 3). From the con- 
dition (iii), we find that when a*<6, there is a state f in which f(a)+f(6), and 
therefore we can distinguish any field quantity from another by their expectation values 
in some appropriate state. 

For any linear functional on Y{,, we can show 

Lemma 2.1. If f2<0 is a linear functional on Y., and f(@) = 0 for all aeX.,, then 
fle) 0. 

Proof. As f=<0, there is an element aé€Y{,, for which f(a)=0. Let f(e) =0, 
then we have for any we} 


OX f ([@e—a]?) = f(a*e— 2aa+ a’) = f(a) —2af(a), 
from which follows f(a) =0, and we have a contradiction. Q.E.D. 
Putting (f+g) (4) =f(a)+9(a) and (af) (a) =af(a) for any fe J€&S_ and aeR, 


we can construct a real linear space 
es n 
ee es ins , = 
iis fs f=>e; vs 
t=1 


Let fe}, and f(a?) >0 for all ae... then f(e)*=0 by Lemma 2.1, and putting 
f’=(1/fle))f, we can define a linear functional f’€%, which satisfies the conditions (i) 
and (ii) in Assumption 2. Let © be the set of all these linear functionals. It is clear 
that ©,CG, and we call every element feS also a state. In quantum mechanics, we can 
define that, for any vector 4, f(a) = (¢', ay) / (¢/, g') is a state in our sense. Let the 
set of these states be ©). Then constructing © by the above method, we can easily 
show that G@)© in general. By using © or Gy, there will be no serious differences 
and so we use © instead of ©, only for brevity’s sake. : 

Since every a€¥{ is expressed as the sum a=d,+idy, d,, ay€Y[,,, we can extend any 
fe, on Woq to a linear functional on YA by the equation f(a) =f(a,) +if(a). Let ¥ 
be a complex linear space which is generated by these linear functionals on ‘WU. If A 


identify fe}, with the corresponding fe}, it is clear that Vr CBs 
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We shall now proceed to the duality of 9{ and 9%. Let. A and B be two linear 
space over a number field K. We say that A and B are dual to each other, if there is 
a bilinear mapping (a, 6) >T (a, 6) €K, a€A, b€B, which satisfes the following conditions : 
(1) for any 450 in A, there is an element 6 in B, for which Lid ep) ex 0} 

Gi) for any 60 in B, there is an element a in A, for which T (a, 6) *=0 [5), Chap. 
IV, p. 48]. 

Let A and B be dual to each other, then a topolcgy in A is said compatible with 
this duality, when, with respect to this topology, 4 is a locally convex Hausdorff space 
and B is the set of all continuous linear functionals on A [5), Chap: IV, p: £67]. 
Especially, the weakest of these topologies is called the weak topology and is denoted by 
o(A, B). Put 

Wop, 25) ens ©) — ac 0, (4) 6) t= te 2, ent 
for every €>0O and every finite set of 6,¢B (i=1, 2,:--,n). These W(b,, €) form a 
fundamental system of neighbourhoods of o¢A in o (A, B) [5), Chap. IV, p. 50]. 

It is clear that Y,, and %,.(2{ and %) are dual to each other, and so it is natural 
to use a topology which is compatible with this duality. In particular, let us consider 
the weak topology 7(Y.., 7). We can easily show that fundamental neighbourhoods of 
a€2U,, ate sets of elements such that |f;(42) —fi(a)|<& f,¢G,, i=1, 2,---, m. From the 
physical point of view, on the other hand, we can roughly say that two field quantities 
which take almost equal values in a finite number of states are near in some sense, 
Bamely wil J 44.) — f,(¢>), 1—1,02,<-, a then, dy—2,. ~~ | hese. two <concepts,wi.e., the 
fundamental neighbourhoods in o (Y,,, ?5,.) and the nearness from the physical point of 
view, are in accordance with each other. Therefore it will be natural to take the topology 
T (Usa Wr) for Asa, and similarly o (QA, ) for W. 

Of course, a+6 and aa are continuous functions of (a, 6) and (a, a) respectively. 
Furthermore 

Lemma 2.2. The mapping a—a* from YX to Y is continuous with respect to o (YA, a) > 
and M,, is a closed subset of MX. The topology which is induced on U,, by o (2M, %) is 
equivalent to 7 (Meas Wy) + 

Proof. %% is generated by %,, and if fej§,, then f(a*) =f(a). Thus the mapping 
a—>a* is continuous in o (2, 9). Let J be a directed set and lim a,=a(ceJ), then 
lim a;*=a*, and as Y is a Hausdorff space a,=a/ (ef) imply a=a*. This shows that 
Y., is a closed subset of Yf. The remainder of this lemma is obvious. QO: Et: 

It is natural to assume that any product ab of two elements of Y{ is continuous in 
some sense, and since the requirement of the continuity with respect to both quantities 
seems too restrictive, we shall assume that ab is separately continuous in some topology. 
If a mapping from Y{ to Yf is continuous with respect to any topology which is compatible 
with the duality of Y{ and %, it is continuous also with respect to o (2, %) [5), Chap. 
IV, p. 103]. Therefore, the following is the weakest of similar assumptions, and this 
fact may be considered as another advantage of taking the topology o (YU, w)- 

Assumption 3. The mapping a—>ab with fixed b is continuous with respect to 7 (2, %) 
for all ber. 
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Lemma 2.3. The mapping b—>ab with fixed a is continuous with respect to o (2, *) 


for all aeX, namely ab is separately continuous. 
Proof. Let T, be a mapping b6—>6*, and T, be b*->b*a*. As these mappings are 
continuous, the mapping (T,-T,-T,)b=4b is continuous. Q.E.D. 
Thus we may say that the topology o (2, %) is compatible with the algebraic 


structure of YI. 


§ 3. Real commutative subalgebra 


The fundamental postulate in § 2 is that every self-adjoint field quantity has a 
definite mean value in each state, but an order relation in Y.,, in other words a comparison 
of any two field quantities, will be a more fundamental problem. For example, let us 
consider the measurement of the length L of a rod, and let the length of a standard 
rod be L,. Then the measurement is reduced to the comparisons between L and nlL,, 
n=1, 2,:--. In general, the measurement of any physical quantity can be reduced to 
the comparisons of this quantity with standard quantities which we may call scales. As 
an idealized scale must have a constant value in any state, it will have a close connection 
with ae, wef, and so we shall be able to compare any physical quantity a€Y{,, with @e. 
In this section we define the precise mathematical meaning of the comparison, and study 
the order relation in YI... 

Let A be a real vector space in which an order relation > is defined. A is called 
an ordered vector space |6), pp. 1, 238], if 
(t)et adeeb implies at+c>bt4«, 

(ii) @>0 and a>o imply aa>o. 

When we compare any element aé{,, with be... it is natural to say that a is 
greater than or equal to 6 if f(a) f(b) for all feS, and we write it a->6. Then it 
is easy to show that Y,, is an ordered vector space, and that the set 98 of all elements 
ao is a closed convex cone in 9,, with respect to o(Y,,, 9%.) [5), Chap. Il, p. 49]. 
Further let 


n 
ppc te ae eR | 2 +. 
Po= {a3 a=} a; dj, a; — 0, 4; € YL. n=1, 2,°-*}, 


i=l 


then we have 

Lemma 3.1. Let the closure of YS, be Y,, then R=P. 

Proof. As 3 is closed, it is clear that $8,C 38. If 8,298, then there is an ele- 
ment a€4$ which does not belong to 48. Since 4, is closed convex, there is a functional 
fe, such that f(a) < f(b) for all bes, [5), Chap. II, p. 73]. As o€¥8,, f(a) <0. 
On the other hand, be}, implies abe’, for all @>0, and then f(b) >>0 for all bes. 
Therefore (1/f(e))feS and f(a) 0. This is a contradiction. Q.E.D. 

As ab means that f(a) > f(b) for all feS, this order relation seems to be very 
suitable to Y,, from the physical point of view, and so we expect that in some sense 
the order relation is compatible with the structures of Ys and Y{. The structures of YI 
and can be considered from two points of view, namely E 
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i) the algebraic structure, 
ii) the topological structure. 

i) As Y,, is an ordered vector space, it is clear that the order relation is compatible 
with the structure of Y{,, as a vector space [5), Chap. II, p. 48]. Therefore, if a, b=>0 
imply ab<o, the order relation may be said to be compatible with the algebraic struc- 
ture [7), p. 71]. However, we cannot assume this property in Y,,, because a, be%,, 
does not imply abeY{,, in general. A necessary and sufficient condition that abe,, for 
a, beY., is that ab=ba. From these considerations we assume 

Assumption 4. If ab=ba and a, bo, then ab>o. 

Therefore, to study an order relation compatible with an algebraic structure it is 
necessary to take a commutative subalgebra whose elements are all self-adjoint, and we 
call such a subalgebra a real commutative subalgebra. In quantum mechanics such a_sub- 
algebra plays a very essential role. Namely, a maximal real commutative subalgebra is a 
complete set of commuting observables, and in a very rough sense we can represent them 
simultaneously by diagonal matrices, and the simultaneous eigenstates of these observables 
form a complete orthonormal set in the Hilbert space. 

Let 11 be any maximal real commutative subalgebra of Y{. It is clear that m is 
closed and ee€m. Put 


m= ifs feds. f(a) =0, aem}, 
and %m=%,/m’. Let fem be the class to which f belongs, and f(a) =f(a). Then 
f is a linear functional on m, m and %}m are dual to each other, and o(in, m) is 
equivalent to the topology which is induced on m by o(Y.,, 2) [5), Chap. IV, p. 54]. 
Now, 1 is an ordered vector space with respect to the above defined order relation, 


PBRm= PN m is a closed convex cone and Bm={aent; ao}. Put 


Sm={f; feS, fem}. 
In order to study the properties of m we can use Gin instead of ©. 

ii) Next, the order relation is to be compatible with the topological structure of 
in. For that purpose, we define order-convergences as follows. 

Let a,| 0, 2, «€J (directed set) mean that ¢<« implies a, > a,, and that the greatest 
lower bound of {a,} is o, that is A {a,;c€J}=o. If (a—a,) Lo or (4,—a) V0, we 
write a, a or a,|4 respectively, and we say that {a,} order-converges to a [6), p. 40]. 

As fem is continuous in any topology which is compatible with the duality in 
m and Sm, we may say that the order relation is compatible with the topological structure 
of m if and only if a,| 0 imply f(a,)—0. Thus we assume 

Assumption 5. If a, 0, then f(a.) 0 for all. fe%m. 

From this we can easily show that, if 4,4 or a,)4, 4,4 in o(m, *$m). Con- 
versely, if {a,}, a,—0 is a directed set of points such that ¢<« implies 4, a,, and 
a0 in o(m, %m) then a,|0. From the above considerations we may say that the 
order relation in m is compatible with the structure of it. Now we must study mt in detail. 

Let A be an ordered vector space. If for any 4, beA there is a least upper bound 
aUb, we call A a vector lattice, and if every bounded subset has a least upper bound 
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(lu.b.) and a greatest lower bound (g.l.b.), we call it a complete vector lattice [6), p. 
238]. An ordered vector space A is called Archimedean if and only if a= nb, nA, + 
imply b<o [6), p. 225]. 

Lemma 3.2. mi is Archimedean. ; b, 

Proof. If a->nb, n=1, 2,--:, then f(a) —=nf(6), for all f€Sm, and then f(b) 0 
and b<o. Q.ED, 

If an ordered vector space A is Archimedean, and if, for any two elements a, bed 


there exists an element c such that a, b<<c, then a completion by cuts of A is a complete 


vector lattice A [6), p. 58; 7), p- 68]. Since 
a, b< (1/2) (e+4)*+ (1/2) (e+6)’em 


for any two a, b €m, we get 

Theorem 1. Let the completion by cuts of m be m, then m is a complete vector lattice 
and m is embedded in m, so that all g.l.b. and |.u.b. existing in m are preserved. 

Thus aém can be divided uniquely in m as follows : 


= a 
oe 


BI 0 ES de Ce 


In the usual theory the positive and negative parts of any field quantity are also field 
quantities in a rough sense. Therefore we also assume that a and a be field quantities 
in the following sense. 

Assumption 6. If a*3s0, aem, then there is an element a’€m such that a => d >6. 

At first sight this assumption will be seen to be short to express that a” is a physical 
quantity in some sense. But Lemma 3.4 will show that a gives a definite expectation 
value f(a*) in any feSm, and that, if f(a*) =0 for all feSm, then a*=o. Thus we 
shall be able to say that a* is a field quantity in this sense. From Assumption 6, it 
is clear that for any a>o, a€m, there exists a’€m such that a> a’>o. 

Lemma 3.3. For any aeim, there exists a directed set of points {a,} such that a,€m 
and a,’ a, and {a,} such that alem and a’ a. 

Proof. Let a>o. If a=o, the lemma is obvious. If a>o, there exists a,em 


such that a=a,>o0, and a—a,~o. If a—a,=o, the lemma is obvious. If a—a,>o 


> 
there exists 6,¢m such that a—a,—->6,>o0, and a—(a,+6,)>0. Put a,=a,+6,, then 


a,ém and aa,>a,>o. Repeating such procedures transfinitely, we can get a desired 
directed set of points {a,}. For any aein, there exists bem such that a->b, and let 
G } (a—6), then putting a.=b+¢, we can show a, fa. Q.E.D. 


Lemma 3.4. Defining f (a) for all feSm and a€M properly, we can make m and em 
dual to each other. 


Proof. For any feSm and aeim, we define 


fauy (a) =sup { f(a’); a’ <a, a’em}, 
and 


fine (a) =inf { f(a’); a’>a, adem}. 


There is an element bent such that b> a, then f (6) = f(a’), and it is clear that ae (a) 


On the Representations of Field Quantities 43 


exists. In the same manner, we can show that Faa(a) exists. -» Leth, ap. 6,21 and 
Age cecrandaput de y= (e,—<a) +: (a$,) =c,—6,. If we put («, 2) <(«’, ¢’) when 
and only when «<x! and ¢</’, then {(«,¢)} is a directed set and dix, yn \ 0. Thus 
Ax, »)—>0, namely {f(c,) —f(6,)}—>0, and it follows / fev f(b) and fry (a). = 
fir (2). Now we define that f(a) =f,,,,(a) =fine(a). From this we can easily show 
that f is a linear functional on m. Further we can show that for any fi, foeSm and 
a, peak 0) a+?=1, ; 


| (af, + Bf.) (a) =f, (a) +f. (a). 
Let feem be f=of,—ph=a' fi b' fi’, where a, [, a’, 2’ 0, and fs for ite f.'<Sm, 


then af, +8/f=a'f/+6'fr, and af,(a)-+8'F (@) =e'F! (a) +8F,(a), namely af, (a) 
—Pf,(a) =a’ f/ (a) —B’f,' (a). Therefore we can uniquely define 


F(a) =aF, (a) —SF.(a), | 
and feBm defines a linear functional on mm, and aem does on Wm. Let aein -and 
f(a) =0 for all fe@m, then there is at least one {a,} such that a,| a, a,€m. Since 
f (a) = f(a) =0, a,>0.. Hence ao. In the same way we can show ao. This 
proves a=—O. OED, 

Lemma 3.5. We can define a product ab for all a, bem such that m is a commutative 
algebra in which a, bo imply abo and m is a subalgebra of im. 
Proof. For any ao, a€m and beim, we put © 


(ab) ,,= V fab’; b' <5, bem}, 


and 


(aby tabl!; 6% 5. bl em. 


As mm is complete, the existence of these two elements are clear. Let 6,16 and 6,’ | 6, 
then (5) —b!) =by.5 0. It follows by, .—0, and aby, 0 by Lemma 2.3. As ao, 
this implies that ab, | 0, that is (ab//—ab,) | 0. This shows that (ab),=(ab);, and 
we can define ab=(ab),,=(ab),. Let ma, a,, a, 0, and m6, b,, 6,, then it is clear 


that 
(4,+ 4))b=a4,b+ a,b, a(b,+b,) =ab,+ab,, 


and (a,b)a,=a,(ba,). For any a€m, a=a,—a, and a,, a,—0, we can define ab=ba 
=a,b—a,b as in Lemma 3.4. Similarly we can define ab for any a, beim. Let a, bo, 


a,, 6<em. If a,) a, 6,16, we have 
/\ Cy V ab,= IX [a, (6—&) or (a,—a4) b, |= /\ dy (6—b,) =P A (4,—a) b,=0, 


and if a, a, & | 4, 
Va b— \abl=\/ [ai (6—b) + (a,—a) b,]< V a, (6—6bi) + V (a), —a) bi =o. 
As A a,b=\/d,b and \/ab,= /\ ab), this proves that the above definition of ab is unique. 


If abs co, then 
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(ab) c= (A 446) c= / [ (a,b) c]= / / [ (4,5) |= p (a, (be,) |= Ain (bc) =a(bc). 
x ; 
The remainder of the lemma can be shown easily. Q.E.D. 

In a vector lattice A an element a, is called a unit if and only if 4,Nb=o imply 
b=o for all b>>o, and for convenience’ sake we call this L-unit, and a unit in an algebra 
A-unit. In the beginning of this section we have pointed out that we would be able to 
compare any physical quantity a€[,, with ae. Now we are in a position to set up an 
assumption which will realize this proposition. Though 1 is not necessarily a vector 
lattice, this assumption will mean that e is something like an L-unit, namely we assume 

Assumption 7. For any a>o, aem, there is an elmenet a’>o, a’em such that a=a' 
and e>d'. 

From Assumptions 6 and 7, we can easily show that e is an L-unit of m. Let an 
element a€m satisfy the condition that there is a positive number @ such that a > f(a) 


>—a for all feSm, namely ae>a->—ae, then we call it bounded. Assumption 7 
_ 


means that for any a>o, aem, there exists a bounded element a’€m such that a= a’ >o, 
and we can show as in Lemma 3.3 that there is a sequence of bounded elements a,€m 
such that 4,74. Therefore Assumption 7 may be interpreted as that any element a€m 
can be approximated in some sense by a sequence of bounded elements. 

When a vector lattice A is a commutative algebra such that A-unit is L-unit and 
that a, b=>o imply abo, then A is called a lattice ordered ring [7), p. 69], and 

Theorem 2. im is a lattice ordered ring. 

In [7), Chap. IV], it is shown that, on a certain topological space 2, the set & 
of all continuous functions which are finite almost everywhere can be considered as a 
complete vector lattice, and that m and m can be represented by an ideal Yim of & and 
a subset 2m of Vin respectively. It is further shown that if F, be a function correspond- 
ing to a, then F,F,=F,, and F,=1. From this we can easily show that a°=o implies 
a=o, and that a=a'—a~ imply a‘a~=o. 

Lemma 3.6. For any ao, aeim, there exists {b,\, 6,em, such that b.>>o and 
(b,)? 7 a. 

Proof. F, is continuous, and so (F,)'?>>0 is also continuous. As F.+F, 
= 2(F,)? 0, and Sm is an ideal, (F,)'?€ Vin. Let a’? be the element corresponding 
to (F,)'”, then there exist 6,em such that 6, fa'?, and (6)? fa. Q.E.D. 


$4. Representation of 9 


In the last section we have studied the various properties of the real commutative 
subalgebras. Now, using these results, we can show that 2% can be represented by an 
operator algebra in a Hilbert space. We begin with the following theorem. 

Theorem 3. For any a€Y(, a*a>o, and a*a=o implies a=o. 

Proof. We first show that, if a*a—=—b and b>o, then b=o. Let m be a 
maximal real commutative subalgebra to which 6 belongs, then from Lemma 3.6, there 
exists {c} such that (c,)2->b, c,>o. and em. Let ag=utivy ug U€UAsas Ges 
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(ae,) (ae,) * + (ae) * (ae,) = 2| (ue)? + (v,)*| =o. 
On the other hand | 
(ac,) * (aq) = ca*ac,= — (¢,)*b <o. 
Hence 
Ga) (ae, * = a( Ga" 2-0: 
and as 33 is closed, aba* 0. Since 
' (aa*)* + aba* = aa*aa* + aba* = —aba* + aba* =o, 


(aa*)*, aba* =o imply (aa*)?=0, aa*=o0, then a*aa*a=o, a*a=o. Next, let a be 
any element of Y{, and im be any maximal real commutative subalgebra to which a*a 


belongs, and let 


aati as We 0,08 ee: 


If u-3<0, from Lemma 3.6 there exists v>o such that u>v?>o0, vem, and u*?=0. 
Therefore u-v’em and va*av=—v'u, namely (av)* (av) =—vu-. Thus v*u- =o, this 
is a contradiction, and accordingly u-=o0, a*a—o. The remainder of the theorem can 
be easily shown. Let a=u+iv, u, veY,. As a*a=o0, aa*aa*=(aa*)?=0, namely 
aa*=o. Thus a*a+aa*=2(u?+v’) =0, and u=v=o. Q.E.D. 

If we assumed a*a—o from the outset, we could extremely simplify the argument, 
but since the physical meaning of adjoint is not clear we have not assumed this. On 
the contrary, the Assumptions in § 3, from which we can prove Theorem 3, refer only 
to the self-adjoint elements, and their physical meanings have been explained in § 3. 
This is the reason why we have followed such a complicated course. 

Lemma 4.1. Put for any feS 


wyp= a3 f(a*a) =o}, 
then %, is a closed left-ideal of 2, and 
or= ita; fla) =o}. 
Proof. Let f(a*a) =o, and y(b)=f(a*ba). (6) is a real linear functional on Y,., 
and from Theorem 3 9(c?)=f((ca)*(ca)) =o for any cé,4, and g(e) =f(a*a) =0. 


It follows from Lemma 2.1 that 7(b) =f(a*ba) =0. Namely for any cel, f( (ca) * (ca) ) 
= f(a*c*ca) =o, and then AY,C Uy. For any a€yy, and ae, 


f(e) —af (a+a*) =f ((e—aa) * (e—aa)) = 0,. 


therefore f(a+a*) =o. In the same way, we have f(ia—ia*) =0, and hence f(a) 
= f(a*) =o. Hence for all dényy, f (Ma) =f(a*W) =o. On the contrary, f(a) =o 
implies aeQ,, then it is clear that, if a, beS,, at+beX, and yy is a subalgebra. If 
a,€%, and aa, f (2a,) =o and then f(a) =0. That is a€,¥,, and .¥; is closed. Q.E.D. 

Theorem 4. For any feG, there is defined a Hilbert space ,, and Y/, is a subset 
which is everywhere dense in §),. There is an operator algebra in \),, which is bomomorphic 


to 2, 
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= ‘ 
Proof. Let a, b,--- be elements of a linear space W/%, For any element be, we 


i $4; j=ba of these operators 
can define a linear operator 6, in 2U/.3; by b,a=ba, and let the set pe 


i i 2 2 ma 4.1 
be Y,. It is clear that bb, is a homomorphism of Y onto YW, From Lem 


CO 2a) (a+ %,)) =f(b*a), and we can define an inner product in at a, by (4, 4); 
=f(b*a), and it is easy to show that this is a proper inner product [8), p- ah — 
\ja|| =[ (4, 4) , |’ be a norm of a, and let the completion of W/G, be HD, then , is a 
Hilbert space. Mf, is a set of operators in Dy, and the domains of these operators are 


everywhere dense in ,, namely YI/ Sp Since 
(E, aj2) =f Bac) =f ((a*b) *d) = (#6, 2) = (a) B, 2) p 
a, and (a*), are adjoint to each other [8), p. 41]. Q.E.D. 

Theorem 5. is isomorphic to an operator algebra in a Hilbert space 9. 

Proof. Let D=@{H,; feS} be the external Hilbertian direct sum of {9,3 feS} 
(5), Chap. V, p: 146]. For any ae, let ds be the operator in which can be 
identified with a, in each §,, and whose domain , is the direct sum of {Y/.\,} with 
the condition that only finite number of components are not 0. Put the set of all these 
operators Y{ 5, then it is clear that 2{, is homomorphic to YW. Let ag=o, then for all 
fe and be, ab=o in ,. Putting b=e, we get f(a*a)=o. This shows that 
a" d=0- a= 0. Q.E.D. 

Lemma 4.2. Put 9(b) = (4, 6,4); for any a€Ml/X,, a0, then (1/9(e))g (6) €S. 

Proof. From the continuity of the product, it follows that g(6) is a linear con- 
tinuous functional on Y,,, that is 7 (6) €%§,, and then the lemma is obvious. Q.E.D. 

In Theorem 5, is not necessarily separable, but in quantum mechanics the state 
vectors form a separable Hilbert space, and if it is not separable there will occur many 
troubles. Therefore we expect that by appropriate physical assumptions can be made 
separable. In the following, we make an additional assumption from such points of view, 
but we may say that it is too artificial and that its physical meaning is not sufficiently 
clear. The main purpose of the following discussion is to show that the separability of 
is closely related to the physical viewpoint. Therefore, in the following sections, we 
do not use the Assumption 8) but the separability of . 

We can make only an enumerable number of experiments, and f(a) can be determin- 
ed by these experiments. Thus the number of f(a) which we can determine by experi- 
ments is at most enumerable. Hence we may assume that there is an enumerable set 
Y. which is everywhere dense in Y{ in a sense. Considering that YI is not only a vector 
space but an algebra, we assume 

Assumption 8. i) There is an enumerable set 2, of elements of YU such that for any 
a€W there are a,€%,, n=1, 2, «++, which satisfy the conditions that a,—>a and a*a,—>a*a 
in o (ML, %). 

On the same ground we assume also 
ii) There is an enumerable set ©, of elements of S such that f,(a) =0 for all feS, imply 
f(a) =0 for all feS. 


Lemma 4:3. 5, is separable, 
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Proof. Let (2(/%,). be the enumerable set of classes each of which contains at 
least one element of Y{.. From Assumption 8, i) it is easy to show that for any ael/(, 
therevareraye (Q0/ S};)i, (n= 1,25 sp esuchethate (a, 2 (4, b), for all be QQU/%,) and 
that |ia,|!|—]ia|l. As Y/Y, is everywhere dense in ,, this shows that d@,—>da in the 
strong topology in ,, namely ||a—a,||>0. Thus (U/N,)_ is everywhere dense in Dy 

QED. 

From Lemma 4.3 and Assumption 8, ii) we can easily show, like Theorem 5, that 

Theorem 6. Let H=Q{H,3 feS.} be the external Hilbertian direct sum of {Dy5 
feS.}, then H is separable, and Y is isomorphic to an operator algebra in §. 


§ 5. Representation of states 


In the last section we have shown that Y{ can be represented isomorphically as an 
operator algebra Y{~ in a separable Hilbert space . Now, in this section, we shall study 
more precisely the relations between 9{ and 2, and especially the representation of states. 

Let a, be the operator in {) which corresponds to aeY{, then it is clear that the 
domain Sy of a; is everywhere dense in . ({), is defined as in the proof of Theorem 
53) ; 

Lemma 5.1. If a=a*, as is symmetric [8), p. 49]. 


Proof. It is clear from the definition of a symmetric transformation in {. 


Q.E.D. 
Lemma 5-2. If ae, is bounded, then there is a unique self-adjoint and bounded 
extension a. of 4 5. If ao, a€Y,,, then there is a self-adjoint extension ag, of de, and 


the spectrum of ag, is non-negative. 

Proof. From Lemma 4.2, if ¢(a) = (6, ayb) and (6, b) ,=1, geS. Then ae>a 
> —ae implies a(¢, ¢) =| (Y, a) | for any element 4/€§),, and as a, is symmetric 
there is a unique self-adjoint and bounded extension as 8), pp oO, 27..365 |.. Onrthe 
other hand, if ao, (¢, a¢ of) 20 for any element ¢/¢€{,, and the remainder of the 
lemma follows at once [8), pp. 188, 388]. Q.E.D. 

Lemma 5.3. Let B! be the set of all bounded elements of , 8 be the subalgebra of 
MY generated by BW’, and the corresponding subalgebra of {5 be Beo- Then any operator 
a5€B¢ is bounded and has a unique bounded extension a. with domain §). Furthermore, the 
set of all these a forms an algebra © and it is isomorphic to Bs. 

Proof. If ach’, then there exists a unique bounded extension 4, with domain 
[8), p. 57]. Therefore any element of Cg, which is generated by these ag, is bounded 
with domain , and then it is continuous. This implies that © is isomorphic to Bg. 
For example, let ab=ba, agbg=beageBg. For any #€, there are ¢,€,, n=1, 2,:--, 


such that 4,4 in the strong topology in . As bs a and ba, are continuous, 


Abo Yn>dghg and bsagdn bsagy, thus agbsP=bs 544, namely dsb =b sag. Q.E.D. 


As we have shown, if a is bounded, ds, is also bounded. If ao, there is a self- 


adjoint extension dg, whose spectrum is non- negative On the contrary, we do not know 
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whether the inverses of these propositions can be proved or not. This is due to the 
following fact. From Lemma 4.2, it is easy to show that if (€, and (¢, ¢/) =1 then 
fOD=VY, ag) e@. It has not been answered as yet, on the other hand, whether any 
state f can be represented in the form f(a) =(¢, 4g¢4) or by a linear combination of 
these forms. Thus if f(a) 0 for all feS, then (¢, dg) = 0 for all ¢€,, but the 
latter does not necessarily imply the former. Therefore in the following we shall study 
this problem. 

Let m be any real commutative subalgebra, m’=mnW, n’’=mn&, and the 
corresponding sets of elements of &¢ be 1, and 1¢ respectively. Then the weak closure 
of 1g is a Neumann’s Abelian ring its [9), pp. 388, 398], and it is the smallest ring 
containing 115 of 1g, namely tg = R (1g) =R(ng). It is shown in [9), p. 401] that 
there is a bounded self-adjoint operator H such that ni=R(HA). In other words, any 
element of 1g can be represented by a bounded H-measurable function F(/) as F(H), 
whereas any such operator F(H) belongs to nz [8), p. 227; 10), p. 213]. Then we 
can show 

Lemma 5.4. 1% is a complete vector lattice by letting F, > F, mean that F,(a)> F(a) 
almost everywhere. 

Proof. Let tg=R(A ) and E(/) be the resolution of the identity corresponding 
to H, and let Mt and Y be two closed linear manifolds defined in |8), p. 189]. Let 
$i, 1=1, 2,---, and @,, kR=1, 2,-:-, be elements of § defined in [8), pp- 247, 250]. 
Then we consider an element ¢/, of such that 


P= Ait SPP a0, B,2<0, k= 1. 2,°°° 


> 
Any element ¢/€{} can be expanded as follows, 
a A; ' N 
P= DTPA DIOS MEDUHs), PEM Gr), 
a k 


and, as E(A)¢/, E(A) oj are mutually orthogonal, 


NEA) MP = SPE MEF SA PEC eur. 
a &k 


From |[8), p. 250], it is easy to show that there are functions F,(4) and F,(4) such 
that 


. ra % 
DPE Silk =) F @d\\EC (ag) |P, 


NG aI 
= 
a 


N Jo . rie ha 5 5 
4 EC) gilt =| FA dE (Be I% 
=o k 


wie the conditions that F,(2)=0 in the point spectrum of H, whereas F,(4) <0 only 
in the point spectrum. Then 


t The O-messbarkeit in 10) can be replaced by the H-measurability in 8), 


Si and turthermor € 
the condition that F(0)=0 in 10) is not necessary. eeankail 
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IE@sHP=| EA +P dle DG 


namely (A) =|IE(A) PIP <p,(A) =|IE() IP, [8), p. 214]. Therefore H-measurable is 
equivalent to (-mecsurable, and, as H is bounded, it will suffice to consider a finite 
interval [@, 9] instead of QR. It is well-known that the set of all bounded and Por 
measurable functions F(a) modulo null functions is a complete vector lattice [6), p. 241], 
namely, tig is a complete vector lattice. Q.E.D: 

From Lemma 5.3, we can easily show that the algebra ng is isomorphic to the 
algebra nv’. Now, we can extend this as follows. 

Lemma 5.5. Ns is isomorphic to mv/’ as an ordered vector space. 

Proof. It has been shown in Lemma 5.2 that a>o (or a>o) implies ag = >o 
(or as >o). Let a, =o and a==o0. From Assumption 6, it is easy to show that 


there is an element bei’ such that 6b>o and ab<o. Thus bs >o and asbs <0. It 


is obvious that 1g is a lattice ordered ring, therefore ag >o and b,>0 imply ashy 0. 
This is a contradiction, and so ag = o implies ao. Namely, 1g is isomorphic to m1’ 
as an ordered vector space. O'E.D: 

From this it is easy to show that in’/=nt’, and also what follows. 

Theorem 7. Let the completion by cuts of mv’ be wv’, then im’ can be identified with 
the set of all bounded elements of m, and it is isomorphic to tg. 

Proof. It is obvious that the completion by cuts of ng is contained in tg. As 
ny can be constructed from ng by adding all the limits of strongly convergent sequences 
in 1g |9), p. 398], we can show that, for any element F(A) eitg, there is a sequence 
4F,,(H1)}, n=1, 2, ---, of elements of Ig such that F,,(A) are uniformly bounded and 
F,,(A) >F (A) almost everywhere [10), p. 210]. Then we can define a function F,(A) 
by F,(4) = ‘ iv F,,(A)}. For any €>0, there is a measurable set M.cla, f| such 


ma 


that the /-measure of |a, 8]—M., is smaller than &, and {F,,(A)} converges to F(A) 
uniformly on M, [11), p. 88, Egoroff’s theorem]. Thus, for any 0 >0 there is a number 
N such that, if m=>N, |F,,(A)—F(A)|<9 on M., and then |F,(A) —F(4)|<0 
on M.. As we can choose & and 0 arbitrarily, F,(A) =F (A) almost everywhere, namely 
F(A)=/ {\V/F,,(4)}. This means that the completion by cuts of 1g is itg- Therefore, 


if we identify the corresponding elements of Mm and tg, we get 
aan aa Hit “lf A) vf pa ” 
mom’ =ig Dm’ =ng. 


As any element of tg is bounded, so is the corresponding element of nV’, Thus, the 
set of all bounded elements of in is nv’, Q.E.D; 
This Theorem 7 and Lemma 3.4 show that any state fe@,, can be considered as 
a linear functional on tg, and we get 
Lemma 5.6. Considering any feS,, to be a linear functional on ig, we can show that 
there is an element ¢),€§) such that fF (A)) = %p F(A) )), for all F(A) eng 


{ In 6), the theorem is slightly different but the modification is easy. 
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Proof. Let the characteristic function be Fy(4) for any (- -measurable subset M of 
], then it is clear that Fy,(H) Ells. Put f(M) =f (Fu(H)), then f(M) is a non- 
From Assumptions 5 and 6, f is countably additive, and so f is 
Let p,(A) be the corresponding function, then it is 


[a, B 
negative set function. 
a measure on [a@, | [11), p. 30]. 
easy to show that, for any F (H) eng [11), pp. 85, 112], 


F(F(H)) = F (2) dp,(2). 


On the other hand, as f is absolutely continuous with respect to /-measure, there is a 
measurable function F; (A) = 0 such that [8), p. 215; 11), p. 124] 


0/0) =| FHA) dy (A). 
Thus 
FE (H)) =| FOFAdEO Fo 0), 
and similarly 


FE (H)) =| PFA d EC Fo 4). 
s | FA deo @® =r(+ 00) =1, we can define F,(H)¢,=¥,€D [8), p. 226], and 
FFH)) =| FQCE@¢, 4), 


FF (H)) =| POdE@ ty yy). 


These imply that ¢/, is in the domain of F(H) and f(F(A)) = ((',, F(A) ¢,). Q.E.D. 
Theorem 8. im can be represented by a vector lattice of H-measurable functions, and 
for any feS., 
f(a) = (Up F(A) f), 
where F(A) is the corresponding function of ae. F,(H) is self-adjoint, and if F(A) is 
bounded, so is a. 
Proof. Let aeim, a>o, and a,=aNne, n=1, 2,::-, then a, fa, a,eim’’. Let the 
cits functions of a, be F,,(A), then there is an extended real valued function 
F(A) such that die F,,(4) =F,,(A) almost everywhere [11), p. 2]. As F,(A4) is H- 
measurable, F,, (i). is sige H-measurable [8), p. 227; 11), p- 84]. Let f, 0,(4), and 
(, be as in Lemma 5.6, then, for any f(A) [11), p. 112], 
lim | F(A dp/(d) =F.) dp (2). 


n> + 


As Fan) =| F(A dp, (A) and f(a,)>f(a), F,(4) is integrable and 


Fl) = | FAdyp@ =| F Ad EA Sp. 
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In the same way we can show that f(a =| PAd|E® dh|\’ exists, and so ¢, is in 
the domain of F,(H) [8), p. 222]. Thus. 


Fl) = | Fe dey) =n Fa(H) 4p). 


IfPa o: put a=a'—a, and F,(4) =F} (A)—F7 (4). As F,(A) is real, F,(H) is 
self-adjoint [8), p. 232], and f(a) =(,, F,(H)¢,). The remainder of the theorem is 
obvious. Q.E.D. 


§ 6. Concluding remarks 


On the Assumptions (1~8), we have shown that the set of field quantities can 
be represented by an algebra of operators in a separable Hilbert space. It goes without 
saying that there may be a case in which some of our assumptions are not fulfilled. 
We can assume, for instance, that for any two field quantities the product is not 
necessarily defined. Even in such a case, however, it will be natural to assume that 
some subsets of field quantities satisfy the assumptions in this paper, and adding some 
other appropriate assumptions, we shall be able to show that the set of field quantities 
can be represented by a subset of operators in a separable Hilbert space. Anyhow, it 
will not be necessary to consider a case in which field quantities are essentially different 
from the operators in a separable Hilbert space. 

On the other hand, we have shown in §5 that, on any m, feS m can be represent- 
ed by f(a) =(¢, ag¢,), $¢D. But, on YY, we do not know whether any state f can 
be represented in like manner or not’. We may say in a rough sense that, in quantum 
mechanics, a pure state can be determined by its behavior on mm, and that any state 
can be represented by these pure states. From the physical point of view, therefore, it 
will be quite natural to assume that f(a) = (4, 4547) for any ae. 

Throughout the present paper we have not touched upon the physical meaning of 
states and field quantities themselves. For example, a state may be a state at one time 
as in the Schrodinger representation or a state independent of time as in the Heisenberg 
representation or a state of some other meaning. We only assert that, when states and 
field quantities satisfy the Assumptions ( 1~8), these field quantities can be represented 
by operators in a separable Hilbert space. Therefore, we may construct many different 
physical theories in this framework. In the next paper we shall mainly consider this 


problem and study the characteristics of field theories. 


The author wishes to express his sincere thanks to Prof. F. Maeda, Prof. T. 
Ogasawara, and Prof. U. Sasaki for valuable discussions about this work, and is also 
grateful to Prof. Y. Mimura, Prof. H. Takeno, and Dr. M. Ikeda for their kind interest 


in this work. 


t As any element of 2 is not necessarily bounded, there may be a case where ¢y€D is not in the 


domain of 4g. 


52 


1) 


H. Wakita 
References 


R. Haag, Danske Vid. Selsk. Mat.-Fys. Medd. 29 (1955), no. 12; Math. Rev. 17 (1956), 112 
(F. J. Dyson). 

K. O. Friedrichs, Mathematical aspects of the quantum theory of fields, Interscience, New York (1953). 
I. E. Segal, Ann. of Math. 48 (1947), 930. 

E. Hille, Functional analysis and semi-groups, Amer. Math. Soc. Collog. Publ., New York (1948). 
N. Bourbaki, Espaces vectoriels topologiques, Hermann, Paris (1953, 1955). 

G. Birkhoff, Lattice theory, Amer. Math. Soc. Collog. Publ., New York (1948). 

T. Ogasawara, Sokuron II (in Japanese), Iwanami, Tokyo (1948). 

M. H. Stone, Linear transformations in Hilbert space and their applications to analysis, Amer. Math. 
Soc. Collog. Publ., New York (1932). 

J. v. Neumann, Math. Ann. 102 (1929), 370. 

J. v. Neumann, Ann. of Math. 32 (1931), 191. 

P. R. Halmos, Measure Theory, Van Nostrand, New York (1950). 


53 


Progress of Theoretical Physics, Vol. 20, No. 1, July 1958 


Theory of Line-Shapes of the Exciton Absorption Bands 


Yutaka TOYOZAWA 


Research Institute for Fundamental Physics, Kyoto University, Kyoto 
(Received April 12, 1958) 


A general theory of line-shapes of the exciton absorption bands is developed with the help of 
generating function method. When the exciton-lattice coupling is weak, and the exciton effective mass 
is small, the absorption band is of a Lorentzian shape, provided that the temperature T is not too 
high. The half-value width H is given by the level broadening of the optically produced K=O ex- 
citon due to lattice scattering, so that it is proportional to T except at low temperatures. If the 
coupling is strong, or the exciton effective mass is large, or the temperature is very high, the absorp- 


tion band is expected to be of a Gaussian shape, and H is proportional to VT. The mutual influence 
of adjacent absorption bands is also discussed ; it causes the asymmetry and repulsion of the compo- 
nents as temperature rises. 

If we replace T by the density of lattice imperfections, the above statements are valid, without 
substantial modifications, as regards the dependence on the degree of imperfections. 

These conclusions are in qualitative agreement with experimental data. The comparison further 
provides us with information on the strength of the exciton-lattice coupling and the energy band 
structure of the exciton. 


§ 1. Introduction 


Since the “‘ exciton” model was first introduced by Frenkel,’ and was applied to 
the explanation of several sharp peaks which are observed on the low energy sides of 
characteristic absorption bands in typical insulating crystals, most of theoretical attention 
was directed to the electronic structure of the exciton.”—*) Results of calculations can 
be compared with the experimental data, as regards peak positions,” oscillator strengths,” 
multiplet structures” and so on. However, the observed peak position corresponds to 
the energy of the exciton with wave number K nearly equal to zero, so that it does 
not directly give us the information on the energy band structure of the exciton. 

On the other hand, the problem of exciton-phonon interaction has been attacked by 
several authors; some dealt with the stationary state problem of the exciton phonon 
system,” while others calculated the probability for the scattering of an exciton by 
phonons.” To relate these results directly with experimental data seems rather difficult, 
at least in the present stage. 

As for the line-shapes of the exciton absorption bands, however, it does not seem 


that remarkable progress bas been made on the theoretical side since the pioneer work 
by Peierls.» It is the purpose of the present paper to develop a general theory of the 
line-shape in such a way that as much information as possible is obtained through com- 
parison with experiments. We also wish that it could contribute to inter-relating more 


closely the theoretical and experimental works stated above. 
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In § 2 of this paper, the exciton-lattice interaction Hamiltonian H,, is derived with 
Wannier’s exciton wave function, including the effective mass approximation as a special 
case. A general formula for the optical absorption coeficient of this electron aang ie 
lattice system is derived, in § 3, as a function of the radiation frequency. The Fourier 
transform of the absorption coefficient, that is, the generating function, is expanded into 
an infinite power series H,,. In the two extreme cases, that is, weak and strong limits 
of the coupling H,,, the series can be brought into simpler expressions. Thus, in the 
limit of weak coupling (§ 4), the absorption band tends to a Lorentzian shape due to 
the motional effect of the exciton. In the opposite limit the band takes a Gaussian 
shape, which means that the “localized exciton” model is valid in this case (§5). 
In the intermediate region, the absorption band may show an asymmetry, depending upon 
the energy band structure of the exciton. 

When two or more exciton absorption bands lie very close to each other, the 
influence of each component on the strengths, peak positions and shape asymmetries of 
other components becomes important, as is discussed in § 6. 

§ 7 is devoted to the derivation of criterion for the appearance of two extreme cases 
in a more explicit form, and to the arrangement of theoretical conclusions so as to be 
convenient for comparison with experimental data. In § 8 we carried out the analysis 
of a few of experimental data available, which proves to be in satisfactory agreement 
with theoretical expectations. 

Further information is obtained through this comparison: one is the strength of the 
exciton-lattice coupling, and the other concerns the energy band structure of the exciton. 
They will be utilized, in the next paper, for speculative investigations of the various 
dynamical processes which the exciton possibly suffers.’ 


§2. The derivation of exciton-lattice interaction Hamiltonian 


Let us consider an insulating crystal consisting of N atoms and N valence electrons, 
and denote the Wannier functions” for valence and conduction bands by a,(x%, m) and 


a,(x, n), respectively, where m and n refer to lattice sites. 


In the approximation of one electron excitation, the wave function for an exciton 


with inner quantum number 4 and translational wave-number K is given by” 
Bars NUS Si exp (iK-m) SU x (D A(m, m+), (2-1) 


where A(m, m) is the Slater determinant of N electrons for the configuration | a,(x, 1), 
d,(%, 2), ++ a,(x,m—1), a,(a, n), 4,(%, m+ 1)-++, a,(x, N)], that is, the configuration 
in which the valence electron of the m-th site is excited 


into the conduction band, at 
the n-th site. 


The wave function U,;-(U) for the relative motion of the electron and 
the hole is to be determined from Wanniet’s difference equation, with Hamiltonian 
FA.,=T-+V, where T is the sum of the energies of the electron and the hole, and V 
denotes Coulomb and exchange interactions between them. 


If we make use of the effective mass approximation, the kinetic term T, as an 


operator on U,, (1), can be replaced by a differential operator W”(—i-0/dl, K), 


where 
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W(k, K) =—W,(k—K) +W,(k), a 2-2) 


and W,,(k), W.(k) mean the one electron energies, with wave number k, of the valence 
and conduction bands, respectively. For a fixed value of K, one can expand W(k, K) 
around a minimum point k,,(K) : 


W(k, K)=J(K) + (k—-k,, (K)) -#/2p(K) -(k—-k,(K)) ++, (2-3) 


the coefficient 4(K) being a reduced mass tensor. If we transform the wave function 


by 
Prx (Ll) =H Ol (OD, Cs 
T is replaced by 
J UK) —9/01-6?/2(K) -0/ol. (2-5) 
The potential part V, in this new representation, is written as 
(|V| 0) 
ig eK) a1) Dirks \ \ BSL 
x e? /|x—x!|-4,* (x’, Da,(x’, 0+) dxdx’ 
+ ek, (KE) I-V) see || a, (x20) a(x, 2) 
p 


Xx e?/|x—x'|-a,* (x, I) a(x’, 0+) dxdx’. (2-6) 


It reduces to a Coulomb attraction —0d,,e*/|/| in the crudest approximation, as is well 


known. 
If we make further simplification by taking parabolic approximation for both of the 


energy bands: 
W (hk) =— (k—k,) -6?/2m,: (k—k,), 


(272) 
W .(k) =€,+ (k—k,) ‘7? /2m, ° (le== kp), 
we have 
k,, (K) = Le k, K)+ rd ier 
J(K) =, 4 K*-#?/2m*- K*, 
where 


m* =m,+m,, pol =m,'+m;', 
Pr = em, =m,-m*", Pe = fermz'=m,-m*—", 
K*=K—K,, K,=k,—hy. (2:9) 


The result is shown in Fig. 1 schematically. Because of symmetry, of course, there are 


actually equivalent branches on the opposite sides. 
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of N-electron system. 


Fig. 1. Energy level scheme in the effective mass approximation. 


We now consider the change of the potential energy between electrons and lattice, 
caused by an arbitrary lattice deformation: it is the sum of N equivalent one-electron 


energies : 
x 
0H, (%,, M59) 25 Xx) = > 0H, (x). (2-10) 
i=1 
Making use of the relation 


(A(m, n) |OH,|A(m’, n’)) = — Baar | a m) OH, X a,* (x, m’) dx 


ao a5 \ ves (x, n) OH, a. (x, n’) dx, 


and then transforming the Wannier functions into the Bloch orbitals 6,(x, k) and 6, (x, 
k), we have 
(Pac |OH|P yrcr) 
=a Ne! 2131310 K* (DU ne (UV) exp {i( K+ k) (i—V)} 


xX £,(k+K—K’, k) + NUS Vix* D Unew U) 


X exp {i(K— K’)l'+ik(l—U)} 5. (k, k—K-+K’), (2-11) 
where 
E, (ht, b’) =| (x, R)OH, (x) bg (x, k’) de (2-12) 


is the usual scattering matrix element of an electron-lattice system. 
The lattice distortion, which causes OH,(x), may be an instantaneous position of 
thermal vibration, or a static deformation due to any kind of lattice imperfections such 
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as dislocations. For the acoustic mcde of vibration we can take (i) the deformable atom 
approximation due to Bloch and Bethe’ or (ii) the deformation potential introduced by 
Bardeen and Shockley," as the case may be. In the case of optical mode of vibration 
in ionic crystals, OH, is nothing but the electrostatic field due to the polarization of the 
lattice, and was given by (iii) Frohlich e¢ al." Throughout these three cases (2-12) 
depends only on (k—k’), that is, &,(k, k’)=6,(k—k’), and (2-11) is written as 


—q,(AK, #K')é,(K—K’) +9,(4K, 2K')é,(K—K’). (213) 


If one takes the effective mass approximation (2:2) ~(2-9), which is tacitly assumed 
also when one calculates (2-12) in the three cases (i) ~(iii) stated above, g can be 
expressed as 


qh (AK, 4K’) = Bs Uxn* DQ Uy nr YD 


= 2K (L) ¢), (1) exp{ —i(K—K’) -p, 0} = q,(K—K’; 4’), 
(2-14) 
Ve (AK, i K’) = >i Un xK* (L) Unrxr (Ll) exp {i K — K’) -U} 
7 


= D9." (DO Ga D exp{+i(K—K’) -p.-l} = q.(K—K’; 22’). 

We now introduce, for convenience, the creation-annihilation operators 4) ,4*, a) % 
for the (AK)-exciton, whose energy we denote by &,,, and the creation-annihilation 
operators b,,,.°, 6,,, for the (w)-phonon with energy quantum bw,,,, where 4 and w 
refer to the mode and the wave number of the phonon. a and a™ satisfy approximately 
the commutation relations for bosons, but this is immaterial as far as we confine our- 
selves to the electronic states in which the total number of excitons is zero or unity, as 
we do in later discussions. The Hamiltonian of the electron system and that of the 
lattice system are now written as 

A,= OK AK AK 
is (2-15) 
A= DFO p20 Digi Bivens 


respectively, whereas the interaction between them is written, in linear approximation but 


otherwise quite generally, as 


Ai, a ps) a, 2a i(AK|f,,|//K’) (by, K-K! = bE kK + Kk) ann Arhr- (2 : 16) 
Be Kr 


In the three cases (i)~(iii) stated above, we have explicit expressions for the 
matrix element (2:12), and (AK|f,|//K’) depends on K and K’ only through K—K’ 
as is seen from (2-13) and (2-14). If we write (AK|(,,|4’K’) =§,(K—K’; 42’), it 
is given by 

Bac (w ; Ad’) = (26/9NMu)"" 9” {ga (w 5 Ad") C, +90 (w 5 44) C5, 
Bop (w ; Ad’) = {2rbwe?/Nwy- (1/#,—1/«)} 1? 1/m- {—gy(w 3 42") +9.(w ; 44}, 
(2-17) 
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for the acoustic and optical modes, respectively. In case (i) C is nothing but the kinetic 
energy of the Bloch function, whereas in case (ii) —2C/3 is the so-called deformation 
potential, that is, the energy change of the band bottom (or the top) due to unit dila- 
tion of lattice. M and 7, denote the mass and the volume of a unit cell, « and x, are 
static and optical dielectric constants. u is an average sound velocity and w/2z is the 
frequency of optical vibration, both referring to longitudinal waves. Ansel’m and Firsov'” 
used (2:17) for the calculation of the mean free path of an exciton, confining them- 
selves to the case /4//(=1s-state), that is, to the intra band scattering, whereas Haken" 
took into account the interband term /4// for the calculation of the self-energy of an 
exciton, Note that we can use (2-17) with (2-14), even when k,~=k,#0. 

g(w) in (2-14) is the Fourier transform of the charge distributions of the electron 
or the hole in the internal motion, because p,-1, for instance, is the electron coordinate 
relative to the center of mass. It represents the effectivity of the electron or hole 
charge for a particular phonon w. Now, q(w, 7/’) tends to unity or zero as » tends 
to zero, according as A=4’ or 44’, while, in any case, it becomes very small when the 
wave length 27/w of the phonon is smaller than the radius of the exciton, that is, the 
mean distance between the electron and the hole. For example, in the case of the 1s- 


state of the relative motion with decay constant a, we have 
ge(w ; 151s) = {14+ (p,w/2a)*}~°. (2-18) 


(2-17) tells us that the optical mode is probably less important than the acoustic mode 
for the intraband scattering of an exciton, for ,,,(w) tends to zero as » when w tends 
(w) ocw 
for the case of a single electron. To the interband scattering, however, both modes 


to zero, whereas [,,(w) tends to zero as w'”; further it is in contrast with 7.,,, 
would make comparable contributions, because only the phonons with finite » would play 


a role in this case, due to the energy conservation, 


§ 3. A general formula for the absorption coefficient 


The Hamiltonian for the electron-lattice system is written as 
H=H,+H,+H,,, (3-1) 


where H,, H, and H,, are given by (2-15) and (2-16). We now derive the coefficient 
of the optical absorption of this system which corresponds to the creation of an exciton. 

Consider a radiation field R in the crystal medium which is assumed to be isotropic 
with refractive index n. Its Hamiltonian is given by 


A,= >3; bw x Cox” Cok, OK= (c n) (3 : 2) 


ok 


where o(=1, 2) and K refer to the polarization (the directions being denoted by unit 
vectors €,,) and wave-number of the photon, and c*, and c,,4 mean the creation-annihilation 
operators. c is the light velocity in vacuum. Strictly speaking, n is not constant in the 
frequency range we are considering ; on the contrary, the absorption itself contributes to 


the dispersion of n. The most satisfactory way would be to determine n and absorption 
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ccefficient self-consistently. To avoid this rather complicated preblem of dispersion rela- 
tion, we take n to be constant over a range of frequency and equal to an appropriate 
average value. 

It is sufficient, for the present purpose, to take into account the interaction of R 
with the electron system alone, H,,, because in the frequency range at issue (w~10" 
sec’), the vibration of lattice cannot follow that of the radiation field. Thus the total 


Hamiltonian is written as 
A, =H +A, FAR: (3 3) 


Denoting the number of (@K)-photons by m,,;, the matrix element of H,, which cor- 
responds to the absorption of a (oK)-photon with simultaneous creation of a (AK)- 


exciton is calculated as 


(Prac POU S eaten oe | Hen Ps “ts Mok >» =) 


a} 


=ieb/me(27bemy ¢ /vynK)"” (Cone Barc)» (224) 
where 


N ; 
&aK = ld ee iat VY \dx,---dx y 
4=1 
aN ee (D) | a (x, DP eiK*a,(x, 0) dx. (3-5) 


Let us now assume that initially the electron system is in the ground state Y, (no 
exciton), the lattice system in state @, and the radiation field in state X,,. At the 


time T, the wave function of the total system is given by 


eH TOY, DX 
7p 
23-i(H+Hre) T/b[y —i/p| f+ Hasls H,,e~?A+ Aw) 4/6 dt, + cette 1% xen 


0 


(3-6) 


In the lowest order of perturbation, let us take, out of (3-6), the term correspond- 
ing to the state in which a (7K)-photon is absorbed and an exciton (with any 4) is 


created : 


e/me- (2mbcm, x/vgnK) 2 ei (mow -V) Ook AS (Con Bax) 
“A 
x | eiWoKk ty e—iH(T—t)/b pe tH lb Pde, Kits ng xml (3 : 7) 
0 


Taking the absolute square of (3:7), and averaging over the initial distribution n= 


(+++, mw, ++) of vibrational states, we have, as the probability for that state to be realized, 
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W (mex 5 Js ra (e/me)* 2themoK/vynK : pap (€,%° 2) Kk") (CcxK Zirk) 


aw 


T T-t 
x ae | dt elo tS (AK |e~iHt/® |7/K) eff tl}. (3-8) 
0 —4 
Here 
{++} gy = Tr, [e~ PH: {---}]/Te, (e PH), (3-9) 


a a. -1 P 
Tr, referring to the lattice system alone, and (xj)~’ being the sbnolnt teapemiaee: 


When T is large enough in (3-8), most of the contributions to | dt, come from 


0 


large values of ¢, and T—t,, so that we can replace the upper and lower bound of the 
second integration by +00. Dividing by Tm,,, the lifetime = of (¢K)-photon is 
given by 
ie (cK) a (e/mc)?* 27h /Uyn WoK os 2 (€.% i £.K*) (C.% ‘Buk) 
pe y 
x | a eiMoK tS (AK |e—#Ht/b 17K) etHxt Bice. (3-10) 


The absorption coefficient is defined as the reciprocal of penetration depth for which 


it 


the radiation attenuates to e~'. It is given, for the frequency o=cK/n, by 


A(w) =n/c-1/2 ->}1/7(o, K), (3-11) 


where average was taken for the two directions of polarization. 
(3-10) is nothing but the method of generating function (or the method of Fourier 
transform) which was applied to the radiative and non-radiative transitions of a trapped 
17) ,18) 


electron. In the present case, however, we cannot generally use the adiabatic ap- 


proximation for the electron-lattice system H, because the exciton energy levels constitute 
continuous spectra due to the translational motion. We proceed in another way, and 
expand the exponential in (3-10) : 


t ta-1 
da i st Se ao (ib) "| dt,-- \ dt, H'(t,)--H'(t,), (3-12) 
0 0 
where 
Hy) = ef (He+Hy)tlb py e—' (He +H, t/b 
=F 513) 3) Buaar (00) ef Cane ree Ear ac) e/b 
Bh AA Awe 
xX ay K 4 Arg (burw €t@nrot — Or and ef@uret) . (3 , 13) 
Defining 


Fy, (K) = 1/2 ps (Con Brxn*) (Con Brn) (3-14) 
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we have an infinite series expression for the absorption coefficient : 


A(w) = 27he?/mvycnox »S) 5) Fy), (K) 
rane 


+0 
x a elwt 18, i tb DULG AW elo) (3 715) 


where 


t tn-1 
UC: 12S) = Gia) of diy | des@K A (1) A (t,) |B) 4. (3-16) 
0) 0 
As H’(t) of (3-13) is linear in the lattice co-ordinates bu» and 6,,,.*, U, vanishes 
for odd n. The first two terms are given by 


U,.(63 ns) BK). =0;5,,, (3-17) 


t t 


Us(t5 ii! 5 K) = —*| de | de, 3S) Burns (0) 
) & At w 


0 c 


x Bua vw) ef (EK —ExK) A /B oi (Exre —Ear We 4) 0/8 
1 


x { (CAS e 1) etOpew TL es efOpee Th (3 . 18) 


The diagonal (A=/’) term is further reduced to 
t 
Us(t5 ahs K) =| (#2) de 313339 Pena (00) 
pe 1 uw 


0 


x ef (aK Ed K +e) he. { Chet 1) etOpet tn, et i@uw th (3 5 19) 
t +o 
- —9*\de | GEG =<) Pdf (By, (3-19) 


0 —o 
where 
fix (E) = pia! ra pal | Pur a (w) |? { i Miota ss 1) 0 (E), Bee eats Ola = 2) 
he 1 Ww 


+ tye 9 (Ex K 40 — One — OK =F) } . G ; 20) 


§ 4. The weak coupling limit 
Consider a continuous function f(E), the value of which is assumed not to vary 
appreciably over the range JE, or more specifically, |f’ (E) HE| —f (E);,.that 1s’: 
b/4E = 7. (E) = bf! (E) /f(E). (4-1) 
For a value of |f| much larger than t,, we have an asymptotic formula" 


t 


| de \ e—itcibf(E) dE ~ | {+720 (E) —iP(1/E)}f(E)dE (|t|>r<), (4-2) 
J 
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where -- correspond to fS0. Thus (3-19’) becomes 
ee ee 
Us(ts tas K)=— i? [de | ab iB *I¥ (0+ 89/9) - fixe E) 


0 -o 


- ~ A) fix (E) /E-dE ¥ inf’ (0) — M.x-/2) \t|—itdax/b, (4-3) 


where 


Dix = (27/6) fix (0) = (2z/h) >>> > | Puara(w) |? 


Al w 
x { (ie 1) 0 (x% =&% K+ Fe ee 
+ Nye (é, Yom Ey, A+ +b yr) } > (4 ? 4) 


dye =— PI f(B)/E- B= 31D [Pyaar 0) F 
pz 1 w 
P KP 
+ 


(4-5) 


X 4 (yp 1 eee = 
( A ) €,7¢0 — ©), Rew Ome ry E57 — ©), Kew + OO pe | 


are the probability (per unit time) of the scattering, and the self-energy, of the (AK)- 
exciton due to lattice vibration. O(E) is Dirac’s delta function and «/ means the 
ptincipal value. Owing to |t|>c,, the first two terms of (4-3) are small compared 
with the remaining terms; we therefore neglect the former for the present. 

For the ultraviolet and visible regions of radiation, the photon wave number K is 
much smaller (~10° cm™') than the reciprocal lattice vector (~10° an™'), so that we 
can put K=O in the above. 

If we take into account only the diagonal term 4=4’ in (3-15), and approximate 
as 


U,(t3 Ad; 0) +U, (3 ad; 0) +e 
=1+ {= (0/2) |t| — itdyo/b} +--+ exp {— (1"y9/2) |t| —itd,)/b}, (4-6) 
we have, for the contribution of 7-exciton band to the absorption coefficient (3-15), 


A, (w) = 4z Pel g,,(° (67, /2) 


3 : 4-7 
3 mryenw {bw — (Exp + Ayo) + BL'yo/2)° 32 


(4:7) shows that the absorption band is of a Lorentzian shape, with a peak at 
Exo +4y) and a half-value width H=6/",, (if I’,9<@ as is always the case). The latter 
is nothing but the broadening of (/0)-level due to lattice scattering. Each absorption 
peak / is expected to be free from the influence of other peaks if Dxa1 = |€r0— Exr0| > 
bI’,) and if the contribution of interband terms U,,(t; 44"; 0) is negligible. The latter 
effect will be shown, in § 6, to be small if b>, A. 

Of course, our approximation fails if, for values of |t| satisfying |¢|>7<,, the ex- 
ponential of (4-6) already decays to values much smaller than unity. Thus our approxi- 
mation is valid only when /’,, is so small that the condition 
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Letde2 SI (4-8) 
is satisfied, in other words, when the exciton-lattice interaction is sufficiently weak. The 
more explicit form of the condition will be given in §7. In any case, at the weak 
coupling limit, each of U,,,(t; 4; 0) (m=0, 1, 2, --:) reduces to simpler form, leading 
fo" 


x Uo, (ts 7VU8 0) => 251 (l36/ 2) |¢| —itd,,/6}"/m! 


m=0 
— O49) tow lca 2) Lib itd,,/b} . (4:9) 


Thus the approximation (4-6) proves to be exact in this limit. 
Next we investigate the effect of the first two terms of (4-3). In the first ap- 


proximation, we have, instead of (4-6), 


e 


Uy + Urt-~ {1— P| f'(E) /E dE} {1 ¥ inf’ (0)} 
x exp {— (40/2) |t| —it4,,/6}. (4-10) 
Correspondingly, the Lorentzian function (the last factor of (4-7)) is replaced by 


bL'y9/2) +2.% {bo — (Exy + 4yo) } 
Cre ye eine 2) ROE (sat (4-11) 
{bea = (Exe + Ayo) }?-+ (6D 40/2)? 


where 
Mar AF ELE dE: (4-12) 


= (2/2)f! (0). (4-13) 


The line shape of (4-11) is asymmetric: the peak position is given by bOya,= 
(Exo + Dro) +4. 4H 61", the half-value position on the high energy side by bw .4= (Ey 
+4.) + 4+.%)b6l,,, and that on the low energy side by bew_1.= (€:)+ 4.) — (4- 
%)bI',). The degree of asymmetry is thus given by 


{ (Bo 51). +bO_1p2) — WO nax} [OL 9= G . (414) 


In the first approximation as regards f(E), the half-value width is not altered** 
whereas the peak height suffers the fractional change 7,. As the total area of the /- 
peak conserves so far as we neglect the non-diagonal U(t; AK) (AAA), the compensat- 


ing change must occur in the tail part of the absorption band. 


§5. The strong coupling limit 


If one takes the approximation of narrow exciton energy bands, whose exact meaning 


» As for the proof of this theorem, see, for example, a work by van Hove.*°) 

** Strictly speaking, I’jo in the exponential function of (4:10) suffers the fractional change of the 
first order in f(E), when one takes U, into account. However, so far as the product of the peak height 
and the half-value width is concerned, as will be in §7, 4 alone is effective for the change. 
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will be explained later, it is appropriate to expand exp {i(:%—€n K+w F bse) ng Pe 
(3-19) in a power series of 7 when },=/, though it is not the case for A, #4. us 


we have 
U,(¢; 44; K)T 
t 
‘ 4 er Be B = 
hae aI} t—r) (aK —&1 K)t/F dz 
h 2 (t7) 
0 
>< Baoan t ed (ic /b) "/n1- Be ax]> (5 7 1) 
n=1 
where 


Baar Kk Si ow , (w) Pur, ar () 
» ow 


x { (hides ss 1) (€, , Ey, K+ el LO) 7 
ae Nw (€,, a ex Kew a5 BOs) *} = Bera K> 


By aar = Be aark (independent of K). 


(5-2) 


The ratios of successive coefficients, B”*” /B™, are of the order of (6/2) where 6 is the 
breadth of each exciton energy band (phonon energies bw,,,. being neglected). 
Consider the term /,=/ in (5-1): 


— Bia py 31 BE (3 (5-3) 
2h n=1 (n+2)!\ 5b 


and define 


Dj?=Byax- (5-4) 


For values of |¢|, for which the first term of (5-3) is of the order of unity, the n-th 
term is of the order of (6/2D)""'/(n+1)!. Assuming 6/2< D, which means that the 
exciton energy bands are narrow enough, or that the exciton-lattice interaction is strong 
enough (see (5:4) and (5-2)), let us approximate 


Uy(t; 44; K) +U,(t; 243 K) yex0my to 


tae D? ‘) 
— {+--+ asexp( — ae 3:5 
2h* P( 26° He 


=)]— 


After Fourier transformation, we get the absorption coefficient for the 4th band: 


A, (wo) =e Pel gel FE exp {— sat kD 
3 muyuonw yD, 2D, 


which is of a Gaussian shape with a peak at &, and a half-value width H=2y7/ 2 (In 2)D,. 
(We have replaced €, in (3-15) by &,, because D> 6/2.) 


(5-6) 


tT Here we do not take K =0, because, 


for the X-ray exciton which is one of the typical cases of 
strong coupling, K is not small. ) 
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The effects of 4,4/ terms in (5-1), as well as those of non-diagonal U,(t; 77’; 
K) (4A4/’), will be shown, in § 6, to be of secondary importance if we further assume 
that 0>D, that is, the exciton energy bands are well separated from each other. 

That we have taken out only the first term in (5-3) means that we have replaced 
H'(t) in (3-16) by H’(0). In this approximation, that is, in the limit 


6/2<D<o, (5-7) 


we can calculate all U,,,(¢; 44; K) (m=O, 1, 2, -:-) explicitly, with the result 


Sam (es M5 K) => —*—_(—* ye (ony [De 


m=0 m=0 (2m) ! b 2" my! 


ES Ds 
=exp( — te), 5-8 
P( et) ae 


as is easily confirmed by considering the possible combinations of intermediate states in 
evaluating (3-16). Thus (5-5) proves to be exact in this limit. 

That the generating function is written in a closed form (5-8) can be understood 
from another view-point. When the energy band is very narrow, the “ localized exciton ” 
is a sufficiently good eigenstate, and one can consider an adiabatic potential of lattice 
vibration for this localized exciton state. We can repeat the same mathematical manipu- 
lation as was done in the case of a trapped electron.” Thus, H=H,+H,+H,, is 
essentially the Hamiltonian for a harmonic oscillator system with equilibrium point dis- 
placed due to H,,, the motion of the electronic system being practically freezed. Thus 
exp (—iHt/b) in (3-10) can be calculated explicitly, leading to the same result as (5- 
8), provided phonon energies bw are neglected against D, as was done in deriving (5- 
8). The width D, of the absorption band is due to the difference in equilibrium posi- 


tions of lattice b2fore and after excitation. 


$6. The effect of interband interaction 


In the previous two sections we have neglectzd non-diagonal terms U,,(t; 77’; K) 
(A341), and further, in th> strong coupling case, the terms 4,7 in diagonal U,,(t; 2/3 
K). We now consider the effect of these terms on each absorption band /. 

In the weak coupling case, we proceed in the same way as in § 4. Making use of 


the formula (4:2), we have 


U,(t 34d’ 3 0) =ib (E,9— Enno) Th [ ef Cao Earo) t/b ft (1419/2) +1( 4% 0/6) } 
etic (DPy310/2) +i (4)310/6) |, (6-1) 
where + depends on the sign of ¢, and L’yir%, 4pxrx are defined as in (4-4) and 
(4:5), |fur,2(w) |? being replaced by Be, (Ww) Burya/(w). Multiplied with exp (—#€,, t/b) 


apearing in (3-15), it is easy to see that the first term in (6-1) contributes to the 
N-peak, whereas the second to the /-peak. Thus the contribution from 
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Fy, (0) e— £2048 U,(¢5 22/5 0) + Fan (0) e904 FU, (t; 12; 0) 
of (3-15) to the Apeak becomes 

—ib(€,—&,,) 714 & Re (Fux Dans) + (24/6) Re Paar aa) $ eT atlb (6-2) 
where the sufix K=O has been omitted for brevity. 


Adding (6-2) to (4-6) as a correction term, we repeat the same procedure as 
used in the end of § 4. Finally we get the change of the peak height 


m= 2 DY Re Pra Anas) /Far (Ex Ens) f (6-3) 


and the degree of asymmetry (the definition bzing given by (4-14)) 
B= (1/2) Px Re (Pyar bI’,,,) / {Fy (€, —€,,) } > (6 az 4) 


which are caused by the influence of other bands. 

The effect of interband interaction represented by 7 and .Y is small if 0,,,>4,,,, 
bI'y,,, that is, if each energy band is well separated and the exciton-phonon interaction 
is sufficiently weak. 

Let us now consider the strong coupling case. As in §5, we calculate (3-18) or 
(3-19) by expanding exp {i(&,. «—6., 7+ +6w,,,.) 7/6} im power series, and take out 
the contribution to the /-peak in the same way as is described in the above. The 
calculation being rather tedious, we only mention the result in the following. 


Out of the summation 


333) Fann (K) ere 18 UC; 2"; K), 
/ 


we take the terms which have exp(—iS,,¢/b) as a principal factor; the contribution to 


the coefficients from the non-diagonal part (4’342") is given by 


Ss (Ey 8, diet sunt 2 Re[ Fir {BS MEL G1) By"), a} 


Ato m= 


sl Sv SY 53 (€ ap Gy sie (net re (€ ane ie '2 Re (F,,,, B' xr Pera) 


“Trahiv m= 0 


Ss ee 1 oak Ges —€,)) < 2Re CP yrrae BS ard | 


Vv i, / m —(m+ + 
apps = (it/b)"/n 9 (—1) (Ey — 8) 2 Re (Fyyy BOaEE™ 


m ant 


+ So IK S) (it/b)"/n!- S) (— Wh ie 


ec Um asl fame 


Xx (€ Vea) 0m ie (6), Ey) 7 2Re (Farrar BS Ne wth (6-5) 


> 


while the contribution from the diagonal part is written as 


2 Peis ay ( “ee 1) m (m + 1) (&,, —&,) —(m+2) Bo 


A,ATAI 
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— SY Fan 3} +1) (G8) BY, 


n= 


+ SS Fy (it/b) DS ts ie (m+ 1) (&, ee ? Bey, 
wv m=0 : 


o 
ae a Fy) (it/b) 3, (€,,—&,) 7 1) Bon 


m=0 


+ DY Fara 3) (ie/b)"/n!- 33 (— 1)" FY) (Ey 8)? BE 


= F535 =) (it/b)"/n! - BY). (6-6) 


Here the primed summation means that 4’/=/ or //’=A is to be omitted. 

The last term of (6-6) has been discussed in §5. All the other terms of (6-6) 
and (6-5) represent the influence of other exciton bands. It is easy to see that these 
terms are small compared with the last term of (6-6) for values of t<b/D, (see (5- 
8)), if the condition (5-7) is satisfied. For instance, a typical term of the last line 
of (6-5) are smaller than the last term of (6-6) by the order of 


COO 2S (6/7 2D CO fan 220, sae 1) 


Thus it is reasonable to regard (5-7) as a sufficient condition for the absorption band 
to be Gaussian and for the influence of other bands to be negligible, althcugh a com- 
plete proof would require the considerations of U,,(t; 1)’ K) with n= 4. 

The constant terms in (6:5) and (6-6) give the change of the strength (area) 
of the j-peak, the terms proportional to (it/b) give the shift, and the terms with (it/b)° 
are related with the change of the width, due to the influence of other exciton bands. 


§ 7. Further discussions of the theoretical conclusions 


In this section we rewrite the conclusions of the previous three sections more ex- 
plicitly, by making use of appropriate models or approximations in calculating relevant 


quantities. 

(i) Calculation of I’, , in the effective mass approximation 

This quantity, given by (4-4), is not only related to the width of the absorption 
band in the weak coupling case, but it is also important in discussing the dynamical 
behavior of excitons in the vibrating lattice field. It consists of the intraband and inter- 
band transitions, of which we here confine ourselves to the former. 

Let us take the effective mass approximation (2-2) ~(2-9), m, and m, being assum- 
ed to be isotropic. We take the lowest exciton band, and assume that the wave function 
for the relative motion is of a hydrogenic 1s-type: $1,(L) = (a?/z)'” exp(—al). 
Putting (2-16) (2-17) and (2-18) into (4-4), we get, as the contribution of the 


acoustical mode to the 1s->1s scattering (see also ref. 11), 
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4m*v,«T 2 
eee ) a ee Cy 
I Is, 1s (ac ) orh? Mie 
(2%2c ke < Wac_ region T) > (7 ; 1) 
| (K*® + ,2) /K* (Wye < K* <a, w,—region IT), (7 -2') 


ate earT at (CE fy (14 BRAY 
= 9nh? M2 Ke 3 py; ( ae 


top POMS) |G 


| pit{1—(1 4 pk y"| tp {1 (14 pe K** y"| 


ae fay ay 


ee rg a a a) 
es ( pee far) 


ae ) 
Pr ape 


(w,,. K*—region III), (Fay 
where 


Bian == > usd ams. (7-2) 
the former being of the order of 10°cm~! while the latter, the Debye cut-off wave- 
number, of the order of 10°cm~'. (7-1)~(7-1") are plotted, as a function of 


optical 


Wy ? My — K* 
vv ~ — —_ ie ~ = 
ie ( 100 ) A 10 ) 
(a) (b) 
Fig. 2. The scattering probabilities [’y of an exciton with wave number K=K)+K*, due to 
acoustical and optical modes, as functions of K*. 


K*(=K—K,), in Fig. 2. They are valid at high temperatures T > 0,(K*), while at 
very low temperatures T<,(K*), T should be replaced by 6, (K*) 


in these formulae. 
Here the characteristic temperature 7)(K*) is given by 
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(3, +K*) /w,, (Region I), ee 

6. (K*) = (2bu/3x) X | % MF Wide 0k egion [) ( ) 
((K* + 000)9/(K*+0,2) (Region ID), (7-34) 

the expression for the region III being omitted as it is rather complicated. It is plotted 


in Fig. 3 for a typical case. It is of the order of (w/»,) 4,=bua/« when K*~». 
In the case of the optical mode we get 
1 


1 2 2y9 m™ 
K ) Z (Pa Pe) ba! 


7 reel 
Uye,7%518 (op. ) =we'( 


Ko 
F (2K¥ 4,2) K*+m,,?  (0<K*<»,,—region I’), (7-4) 
[n(2K** +2) / KY +02 + (@+1) (2K — 942) / Ke — Wy? 
(%,<K* <a, w,—region II’), (7-4') 
where 
Wop = (2m* w/b)'” Km, (745) 


the former being of the order of 10’cm~'. The 0, (K*) 
expression for the region III’: K*>~w,, has been ’ 
omitted for brevity (see also ref. 11). These 
are plotted in Fig. 2 for a typical case. 

It is to be noted that for all values of 
K* the contribution of the optical mode is 
relatively unimportant, as is already stated in § 2. 
Especially, for small values of K*, the contribu- 


tion is quite negligible compared with that of the 


acoustical mode (see Fig. 2a). Thus, a thermal 0 
exciton with K*~ (3m*xT/b’)'” is scattered by 


acoustical vibrations alone. The mean free path Fig. 3. The characteristic temperature 
@)(K*) for I) 5 47-5; (ac). 


Wace Wo 


SK* 


[ is given by 


4 m*?o«KT 


Ci Cy), 7-6 
Gan A ( ) (7-6) 


fe Mae (Eh Ree ge) 
for values of K* which are >w,, and <a, ™, being independent of K*. It is valid 
at all temperatures, except at very low temperatures of a few degree (see C2) 
Moreover, it does not depend on the wave function of the internal motion of the exciton. 

In the same way, we can calculate 4), and D,, which are related to the peak 
shift in the weak coupling case and to the half-value width in the strong coupling case, 
respectively ; however the result is rather complicated, and will not be written here. 

(ii) A simplified approximation for the overall structure of f(E) 

The effective mass approximation used in the above has two drawbacks ; firstly, it 
is valid only for the excitons near the extremum of the energy band; secondly it is 
presumably not a good approximation for the internal motion of the exciton which has 
a binding energy as large as 1~2 eV, as is the case for alkali-halide crystals. 
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As regards the latter situation, it would be more practical, at least in the present 


stage, to take into account the effect of internal motion by assuming, in (2-17), that 


( when w<79,, 
ARR es ee lo when 2% < wam, hae 
A + ~ "0? 


where (w,)~! is of the order of exciton radius. Then the calculation of D, defined 


by (5:4) and (5-2) becomes quite simple, the result is written as* 


era a) le He (7-8) 
9 Mv’ » KO, i 2 o,. 
where 
A = (3/8) buw© /K, (7-9) 
and C=C,—C,. 


As for the first problem, we note that not the local (the neighborhood of the 
bottom or the top) but rather overall structure of the exciton energy band is important 
in calculating f,(E), which was defined by (3-20) and characterizes the main feature 
of the line shape. 

If we confine ourselves to intraband effect, and moreover neglect phonon energies in 
(3-20), f,(E) is different from zero in the interval (&,—& 9, €,-—&,9), where &,, and 
€,, means the energies of the bottom and the top of the energy band. Further, it is 
easily confirmed that at the both ends of the interval f(E) tends to zero as {E— (&,, 
Ex) $1” of {(E,—EF,9) —E}". If most part of the energy band lies on the high 
energy side of €,,, it is rather plausible to suppose that the peak shift J is negative 
according to (4-5), and that the asymmetry .°Y is positive due to (4-13). 

In order to take into account these situations in a simplest way we assume the 


following functional form for f(E) : 
| » \gy IP ; ; 
fi (E) = {( 1.) -(E- ‘ pa) | (—1X<pX4+1), = (7-10) 


where 6, is the breadth of the /-energy band, and p, is a parameter representing the 
position of €,, in the whole energy band. p=-+1(—1) corresponds to the case that 
K=0 is at the bottom (top) of the band. The normalization constant s, is determined 


by considering that the integral of f,(E) is equal to D,° according to (3+20) and (5- 
2) ;. that is; 


5,=8D,"*/b,". (7-11) 


Note that a large s corresponds to a large coupling constant, to high temperatures or to 
a narrow energy band, as is easily seen from (7-8) and (7-11). In the following, we 
shall omit the suffix / so far as the intraband effect is concerned. 

Making use of (7-10) we can calculate the quantities given by (4-4), (4-5), 
(4-12) and (4-13) ; the results are as follows : 
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H=bl'=sy/1—p*6, 
t= — (sp/2)6, 
M= (5p/2)/V 1p, 
YN 


(iii) Criterion for the classification into Lorentzian and Gaussian cases 


(7-12) 
(7euL3)) 
(7-14) 
(7-15) 


The power series (5-1) is calculated, with the use of (7-10) and (7-11), as 


we —_p( 5) + ipDeo(. +) + (p+) pL) + ay 


(7-16) 


If s>1, we can neglect all but the first term in (7-16), as well as in U,, U,, 


Fig. 4. The classification into Gaussian, Lorentzian, Asymmetric and 


Intermediate cases according to values of p and s. 


Tl 


$5 
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and therefore get (5-8), which leads to a Gaussian line shape. . 
Next we introduce the ratio of the half-value width formula for the Lorentzian case 


to that for the Gaussian case : 


a Es pe Ee 1p 1 a: (7-17) 
2Y2(In2)D, In2 


According as r>1 or r<1, the dynamical region (5-5) or the stochastic region algae 
becomes more important than the other. In order that the Lorentzian case is realized, 
not only the relation r<1, but also (4-8) should be satisfied. In the present approxi- 
mation, 7,(0) defined by (4-1) is calculated as* 


7-(0) = {2|p|/C1—p*) 6/6, (7-18) 
and (4-8) is rewritten as follows : 
s|p|// 1—p =2|S| <1. (7-19) 
In Fig. 4 we divide the (|p|, 5) 


plane into several regions according to 
the above discussions. The region G 
(Gaussian) is defined by s>1, where- 
as L (Lorentzian) is determined by 
r<1 and |.%|<1/2. We denote by 
A (asymmetric) the region where 
5<1 and |.97|>1/2, and the remain- 
ing region will be called J (inter- 
mediate). 

(iv) Temperature dependence and 
the effect of lattice imperfections. 


In the above discussions we have 


considered the dynamical vibrations of 
the lattice. When there are any types 
of lattice imperfections such as dis- 
locations, vacancies, interstitial atoms 
or impurities, a static displacement of 
the lattice is brought about. The 
expectation values 6,,,,,*, Bus are 
generally not equal to zero. If, how- 


ever, the imperfections are distributed at random, as is presumably the case in actual 


0 7 —— pe 


Fig. 5. A typical temperature dependence of the half- 
value width H expected from theory. 


crystals, the linear effect of 6 almost cancels out, and we have only to replace Roo 


Five Cusco Cl ae.) by a suitable quantity which is proportional to the density of 


imperfections. Thus the effect of increasing imperfections are qualitatively the same as 


* We use t.(0) instead of Te 


(E), because the value near E=0 is the most important in using the 
6-function approximation (4-2), 
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that of raising temperatures. In the follcwing statements, which ccrcetn the timperatvre 
dependence of the line-shape, the word “ absolute temperature’ can be replaced, when 
one deals with the dependence on imperfection, by “the density of imperfections” with 
a suitable proportionality constant, without any other substantial modifications. 

As we have seen in (iii), the exciton absorption band is of a Gaussian shape when 
the exciton-lattice coupling is strong or the exciton energy band is narrow (corresponding 
to large effective mass), while it is Lorentzian in the opposite limit. If these quantities 
are in suitable ranges, s increases from a value s<1 at T=O°K to values >1 at high 
temperatures (see (7-11) and (7-8)); that is, the line shape is Lorentzian at low 
temperatures and tends to Gaussian as the temperature is raised. If &) is near the 
bottom or the top of the energy band (1—|p|<1), the line shape is strongly asymmetric 
in a rather wide-spread intermediate range of s (see Fig. 4). 

In the region L, the half-value width H is proportional to T except at low tempera- 
tures (T<0,, see (7-12), (7-11) and (7-8)), while it is proportioral to Var in 
G (see Fig. 5). 

The interband interaction becomes important when other exciton energy bands lie 
close to the band we are considering. For example, the peak shift (being proportional 
to IT) due to the intraband effect is given by (7-13), and it is negative or positive 
according as &,, is near the bottom or the top of the band, while the contribution of 


the interband effect is given by 


Ae Os) (Ho 2O) 


according to (4-5), which is negative for the lowest exciton band. In the region G, 
the contribution of intraband effect to the shift is small, whereas the interband transition 


causes a shift given by 


4,’ ~>¥ = . {By a, + Pe Pan Baan) of =), C2) 
Mm Eng Equy | Fy, 0 


if we take the terms with the lowest order in (6/0), out of the coefficients of (—it/h) 
in (6-5) and (6-6). In the same approximation the dispersion in the region G is 
given by 

D?—4,”. (7822) 


The first term of (7-22) is proportional to T while the second is to T?. It is possible 
that as temperature rises the half-value width tends to saturate or even to decrease, 
although in such regions, the peak separation becomes very poor due to D~0O. | 

As for the asymmetry in the region L, there is a contribution from the interband 
effect given by (6-4), besides the intraband effect (7-14). It is to be noted that both 


are proportional to T. If two exciton bands lie very close to each other, and if the 


interband contribution is dominant as regards .°%, both absorption peaks are expected to 
have slower decents towards the outside (Fig. 6a) or the inside (Fig. 6b) according as 


74 Y. Toyozawa 


(a) Re(Faar Paar) >0 | (b) Re(FrarTaar) <0 
Fig. 6. The interband effect on the asymmetries of adjacent peaks. 


Re (Fy3, 1,3) S0, the smaller peak being distorted more strikingly than the larger one. 

In the region L, the product of the peak height and the half-value width varies 
as 1+7, where 7 is proportional to T (at T=494,). The contribution from the imtra- 
band transition is given by (7-15) and is positive,* whereas the interband effect is given 
by (6-3). In the region G, 7 is mainly due to the interband effect. In the lowest 
order of (6/0), it is given by 


D= Sy Bye aa 7 Pip +Fans 
al (c,— Cae) Fy 
f : o— \ 
4 5¥ 2 : Re {Fyn (Buran By aa) } (7-23) 
ar (E,9—E a0) * F,, 


(see (6-5) and (6-6)), and is proportional to T. 

(v) Physical meanings of the criterion 

Let us consider the meaning of 7,(0) defined by (4-1). According to (4-2) and 
(4-3), the probability that an exciton has suffered a collision by a phonon is propor- 
tional to ¢ only if |t|><c,. 7, is considered to be a minimum duration time of collision : 
if we imagine a wave packet of an exciton which collides with a wave packet of a 
phonon and is then scattered, the collision duration time is of the order of ¢, in the 
most favorable cases (that is, for appropriate forms of the initial wave packets). Thus 
the condition (4-8) for the Lorentzian case means that the time between successive 
collisions is large compared with the collision duration. This is the well-known condition 
for the validity of the usual transport equations, and was discussed by Peierls” for the 
case of metallic conductivity, by Seitz”) for the case of semiconductors, and has recently 
been extended to general case by van Hove.” 

It ie instructive to compare the division in Fig. 4 with the “ polaron” problem.” 
The region L corresponds to the weak-coupling case, where the usual perturbation theoretical 


calculation is valid, while the region G corresponds to the strong coupling case where 


ie tee o is positive seems to be very probable, irrespective of the approximation (7-10). For, in 
(4-12), f’(E) is positive (negative) near the bottom (the top) of the band, where E is negative (positive), 
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the so-called self-trapped electron is a good approximation. The region A corresponds to 
a slow electron (note that 1— |p 


<1) in the intermediate coupling case whose exact 
treatment is rather difficult. In our case, too, it would be necessary to take into account 
the cloud effect suitably, in order to discuss the line shape for the region A more elabo- 
rately. 

The situation in the Lorentzian case is analogous to the motional or exchange 
narrowing effects of magnetic resonances.” Thus, excitation energy (& ), kinetic energy 
of an exciton (€,4-—€&,)) and exciton-lattice interaction H,, of our case correspond to 


Zeeman energy, exchange coupling and dipolar interaction, respectively, of the magnetic case. 


§ 8. Comparison with experimental results 


We now compare the theoretical conclusions stated in the previous section with 
several experimental data available at present. 

(i) Analysis of Fesefeldt’s*? and Martienssen’s” data shows that at low and room 
temperatures, the first exciton absorption bands of KI and RbBr crystals are Lorentzian 
in the main, while it approaches the Gaussian shape at high temperatures, as is expected 
from theory (see §7 (iv)). Other alkali-halides seem to show the same tendency, 
though not so clearly as in KI and RbBr. 

According to Tutihasi’s"? data on AgCl crystal, the absorption curve on the low 
energy side of the first peak, at T= —184°C, is in excellent agreement with a Lorentzian 
curve, with half-value width H=0.177 eV. On the high energy side, overlaps with 


other bands prevent the analysis. 


oO specimen (f) 


x specimen (e) 


specimen (b) 


Lorentzian 


Gaussian 


+6 +5 +4 <3 


— (EE ven) /H 


Fig. 7. The absorption curves for the first peak of KI (containing 10 mol % of KF) measured by 
Fischer. The peak height and the half-value width are reduced to unity for each specimen, 
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Table 1. The results of analysis of Fischer’s data on the first absorption peak of KI crystals 


containing 10 mol % of KF. 


. halfvalue | j. ez 
specimen peak position | peak height | ak H line shape asymmetry 7/7 

r Cf 5.844 Ne | 2.16 0.067 eV i: very small 0.178 
(e) 5,840 1.40 0.115 - — 0.08 0.198 
(d) DYOD 0.674 | 0.37 (L) —0.30 0.305 

(eo) 5.63 | 
(bier Se 0.467 0.42 c 0.196 
(a) | 5.48 0.434 0.40 G 0.174 


————— 


Fischer” measured, at T—=20°K, the absorption coefficients of a number of KI 
(containing 10 mol % of KF*) films with different preliminary heat treatments. The 
higher the temperature is maintained or the longer the treatment is continued, oe less 
will be the crystal imperfections. In his specimens (f) and (e), which are considered 
to be the most perfect ones, the first peak is of a Lorentzian shape with relatively small 
width, while the specimen 
(b) and (a) which must 
have high densities of im- 2 - | 


perfections show nearly ans of a | 
Gaussian shapes with large / , 

widths, as is shown in WH ; A t <! 
Fig. 7 and Tab. 1. These / / 


results again confirm the 


theoretical expectation (see 
AG), 

In Fig. 8, we plot the 
half-value widths of the 
first peaks of KI and RbBr, 


as measured by Fesefeldt, 


s  RbBr (Fesefeldt) 
@ KI (Fesefeldt) 
© KI (Martienssen) 


Fischer and Martienssen at 
various temperatures. The 
temperature dependences are 


in qualitative agreement with 


i A... ——eoeeee | 

the curve of Fig. 5, show- 100°K -200°K.~=—s«300°K.—~—SiC«AOOK.~—SC«SOOK 
ing a tendency of bending r 

downward from the linear 


Fig. 8. The observed half-value widths H of the first peaks of Rb 


curves Hoc T (extrapolated Br and KI at various temperatures (after Fesefeldt and Martienssen). 


Fischer carried out the experiments on KI containing warions amounts of KF. Here we confine our- 
selves to his specimens containing 10 mole % KF, because his data are most detailed for these, and in addi- 
tion, the absorption curves seem to be almost the same as in pure KI specimens, 
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from room temperature values), at high temperatures where the absorption bands are, 
in fact, Gaussian. The reason for the discrepancy between Fesefeldt’s and Martienssen’s 


data is not clear, but it might be due to some kinds of crystal imperfections contained 
in the former experiment. 


(ii) We now proceed to somewhat detailed analysis of Fischer’s data on KI. The 
results of analysis of the first absorption peak are shown in Tab. 1 and Fig 29. tipatiae 
first column of Tab. 1, the specimens are arranged in the order of increasing imperfec- 
tions. We can take the peak shift as a measure for the imperfection density because 
of (7-13), (7-20) and (7-21) (all of these quantities are proportional to T in the 
case of temperature variation), though the proportionality constant in the region L is 
different from that in the region G. In the last column the area defined by const. X 
(peak height) X (half-value width) are shown. In the L region, the constant is taken 
to be 1/zIn2 times of that in the G region (compare (4-7) and (5-6)). 

In the Lorentzian region, H, .°%, 7 and shift are expected to be proportional to 
each other according to (iv) of §7. This relation is approximately satisfied for the 
specimens (f), (e) and (d), as is seen from Fig. 9. Making use of the values of H 
for (f£) and (e), the area for the ideal crystal (which means that H=0, and is physical- 
ly not attainable because of zero-point vibration) is extrapolated to be 0.150. For the 
specimen (e), therefore, we have 
7 = (0.198 — 0.150) /0.150 = 0.3 | 
0.32, while .97=—0.08. Sub- | cox 
stituting these in (7-14) and 
(7.15) we have, p=-—0.45, 0.2 5 % —0.4 7 
and further comparing (7-12) byameicy + | 
with the experimental value 
for (e), we get b=0.40 eV. seal — boa 
The peak shift between (f) r Boat 


one en 8 ev Ngee 0.3eV  —-0.4eV | 
ea to be 4= +0.012 eV 5.85e F : ” : 


rap eend foes 
whereas the observed value is 5 g.y/ jis 
4d=—0.004 eV. This discre- 
pancy would be mainly due to fo) 
the interband effect (7-20),  5.7evyr| Peak x ss tinte) 
nd) 


yy) 


- 


axe 


specimen (f) 


which is certainly negative for 
the first exciton band. 


In the Gaussian region, 5.6eV 


> ba 


the peak separation is rather 
poor, and we have analysed 
the absorption curve only on — 5.5eV 
the low energy side ; the values 
of H for (b) and (a) were 
obtained by assuming the sym- as functions of the half-value width H (see Table 1). 


Fig. 9. The area, the asymmetry .9/ and the peak position 


72 Y. Toyozawa 


metry of the curve, though this is rather doubtful. The saturating or decreasing tendency 
of H, as the imperfection increases, 1s, however, well established from this data, and this 
is in accordance with the theoretical result (7-22). The second exciton peak of KI is 
separated from the first peak by 6~0.5 eV. We can write the shift of the first peak 


as d/=—vD?/d in view of (7-21) and (5-4), where ¥( > 0) is a dimensionless quantity 
of the order unity. Consequently we get H~2(D?— 4")'?=2{— (6/v) d’—A"}", and 
the maximum of H is to be realized at 4’(m) =—0/2¥ with H(m)=0/». From Fig. 


9 we estimate H(m) ~0.47 eV and d/(m) ~—0.20 eV, though with some ambiguity. 
The relation H(m) =—24'(m) is approximately satisfied. On the other hand we have 
D(m) =H(m) /\/ 2 ~0.33 eV, and according to (7-11), s(m) ~5, the result which is 
consistent with our assumption that this neighbourhood is well in the region G. 

(iii) Let us now compare the observed and calculated half-value widths. According 
to Martienssen’s data, the first peak of KI shifts to low energy side by 0.10 eV and 
the half-value width increases by 0.10 eV as the temperature is raised from 155°K to 
293°K. According to Fischer’s experiment (Tab. 1), the ratio of the peak shift to the 
increment of H is 1/12 for (f)—(e) and 1/6 for (f£)—(d). Thus the most part 
(~0.09 eV) of the temperature shift of the peak is considered to be due to thermal 
expansion, the remaining part (~0.01 eV) being due to the dynamical lattice vibrations 
(self-energy). Making use of thermal expansion coefficient (linear) 0.4-10~* deg’, we 
estimate the deformation potential: E,=—0.09 eV/(3X0.4X10-*X 138) =—5.4 eV, 
C= — (3/2) XE,=8.1 eV. If we take the effective mass approximation, this C cor- 
responds to C,—C,, because the exciton energy band shifts by E,.J—E,,4 when the 
conduction band shifts by E,.4 as a whole and the valence band by E,,J (Here J 
means the dilation). 

In this connection it is to be noted that Kawamura ™ has calculated the deformation 
potential for the conduction band of KCI crystal by the cellular method with result 
Ee=—5.4 eV. It would presumably be smaller for KI crystal. On the other hand, 
Ew would be of the order of +Me?/3a=:2.4 eV for KI (M: Madelung constant, a: 
lattice constant) because each of valence band electrons is considered to be fairly localized 
on a halogen ion. Thus the value E,=—5.4 eV can be reasonably explained. 

Comparing (7-12), (7-11) and (7-8) with Martienssen’s data H=0.20eV at 
T=293°K, and inserting C=8.1 eV, Mu®/u=cy=0.25-10" dyne cm* (we follow the 
procedure due to Bardeen and Shockley’ in evaluating ¢,; as for the elastic constants 
see ref. (29)), 6=0.40 eV and p= —0.45, we get w/w, =0.62. This means that the 
radius of the exciton is slightly larger than the lattice constant, which seems plausible 
in the case of alkali-halides. 

According to (7-8), the low temperature value of H should be (,/T) times the 
value at a high temperature T (still in the L region). Comparing Fischer’s data H= 
0.067 eV (the specimen (f) at 20°K) with Martienssen’s data at T=293°K stated 


above, we get 6,=99°K. On the other hand, the Debye temperature O,=bum,/« is 
estimated to be 190°K. Now, 9, must be smaller than @ 


p/2 according to the general 
theory. 


If we use (7-9) , is estimated to be 40°K. Although this estimation has 
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only qualitative meaning, the observed H at 20°K seems too large to be explained by 
the zero-point lattice vibration alone. This discrepancy is presumably due to a small 
amount of imperfections frozen in, in an inevitable way, on cooling the crystal, the 
fractional contribution of which is larger for lower temperatures. We must also add that, 
of Fischer’s data, we are analysing only those specimens which contains 10 mol % of 
KF. 

(iv) In the above, we have neglected the interband effect (except in the Gaussian 
case). This effect is presumably not small, as is pointed out in (ii) in connection with 
the peak shift, and consequently the estimated values of 6 and p are of only qualitative 
meaning. Moreover, the true width of the energy band might be larger than 6, because 
the latter is the width of that part of the energy band which is connected with the 
point K=0 through the transition coefficient §(K) of appreciable magnitude. 6 may 
be much smaller than the true width of the energy band if the exciton radius is large, 
while they do not differ very much for the case of alkali-halides. 

On the other hand, the band bottom €&),, is estimated, according to (ii), to lie 
(1—p)6/2=0.29 eV below &,. This is consistent with the observed value 0.32 eV 
for the energy difference between the first peak and the f-absorption peak.” The latter 
corresponds to the creation of an exciton which is trapped in the neighbourhood of the 
F-center, and the required energy must be smaller than &,, by a trapping energy. 


Here we should like to refer also to Howland’s*” 


calculation on the energy band 
structure of KCl, according to which the valence band is fairly narrow (~1 eV) and 
the uppermost branch has a maximum on a (011) direction nearly midway between the 
center and the edge of the first Brillouin zone. Making use of his result, and assuming 
that the bottom of the conduction band is at K=O with effective mass equal to the 
true electron mass, the procedure described in (2:2) ~(2:9) leads to an exciton energy 
band which is fairly narrow and whose bottom is coincident with the top of the valence 
band in K-space. The situation would be qualitatively the same for KI (apart from the 
large spin-orbit splitting of the valence band”). This rather speculative consideration is 
consistent with the result of the above analysis. 

(v) According to Martienssen,” the first exciton peak of CsI is followed by the 
second and third peaks with rather small gaps, and shows, at the same time, a strong 
asymmetry (-Y%=—0.23 even at 20°K; compare with the specimen (f) "of KE in Tab. 
(1). It is very probable that this strong asymmetry is largely due to the interband 
effect. In the data of Philipp and Taft,” the temperature dependence of the separation 
between the first peak and the other two peaks is well expressed by the formula a+7T 
where @=0.21 eV and 7=6.4-107' eV/K°. These values of .°% and 7 are reasonably 
explained by (6-6) and (7-20) if we assume that the interband exciton-lattice coupling 
constant is as large as the intraband one. 

(vi) In Cu,O crystal, three hydrogenlike series of absorption lines are observed,” *” 
which are interpreted by Gross and Nikitine to be due to excitons. Each line has a 
very small width H, of the order of 107° eV at most.”? This might be partly due to 
the large radius” of the exciton which makes (wv /m)* of (7-8) fairly small, and partly 
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¢ 
die ’to ithe smallnéss of C itself (note that Hoc C*). © In ‘fact, the temperature shifts 
of the series limits”) are as small as one tenth of that of the first peak in KI, thus 
leading to a small deformation potential. 

(vii) In the case of an X-ray exciton, the hole corresponds to the innermost shell 
of an atom, and the effective mass m, is very large. In this case, the dipole-dipole 
interaction of the Heller-Marcus"” type might be more important in determining the 
effective mass m* of the exciton; nevertheless m* would be large unless the oscillator 
strength for the transition is as large as ~1. Thus we expect that the absorption peak 
corresponding to an X-ray exciton is of a Gaussian type for any temperature. 

The experimental data for KCI due to Kiyono,” as well as that due to Trischka* 
seem to support this conclusion, as has also been noted by Parratt and Jossem,™” although 
a quantitative analysis of the line shape is rather difficult in the present stage of X-ray 
spectroscopy. The observed half-value widths (2~3 eV), on the other hand, seem to 
be too large to b2 explained by the exciton lattice interaction alone. 

In conclusion, the author would like to express his sincere thanks to Profs. T. 
Matsubara, K. Tomita and R. Kubo for stimulating discussions, and to Prof. S. Kiyono 
for various information on the experimental works. He is also greatly indebted to Dr. 
F. Fischer, the University of Gottingen, for kindly sending him detailed experimental 


data which have bzen quite stimulus and useful. 
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The effective range theory is developed for electron-hydrogen scattering. The scattering length 
a, and the effective range rp, of the singlet state are determined by making use of the accurate solu- 
tion of the H- ion state, as 


ds=/7.03, 19,=3-37 (in atomic unit). 


The scattering length a, and the effective range ro, of the triplet state are calculated from the zero 
energy solution by a variation-perturbation method. The results are a,=2.34, rm,=1.29. 


§ 1. Introduction 


The scattering of an electron by a hydrogen atom is the simplest and the most 
fundamental problem among electron-atom scatterings. Apart from the direct application 
of the result to the exploration of the physical condition of celestial bodies, to the 
researches of discharge phenomena and so on, it has an important theoretical significance 
as a test of various approximations used in the calculations in more complicated collision 
problems pertaining to a complex atom. 

The method of calculation for the problem of electron-hydrogen scattering at low 
energies has been improved by many authors.') The Born and distorted wave approxima- 
tion in a central field of a hydrogen atom gave a rather poor result, for one finds the 
same cross-section for both the singlet and the triplet states, and also the zero phase- 
shift for the zero energy limit. The solution of the Hartree-Fock type equation including 
the exchange effect was obtained accurately by a numerical method.) Subsequent calcu- 
lations” taking the polarization into account showed that the Hartree-Fock approximation 
gives a fairly good result, especially for the triplet state, but a more refined calculation 
is needed for the singlet state. 

A powerful method for treating a low energy scattering, so-called the effective range 
theory, was put forward’ by Schwinger and Bethe for the nucleon-nucleon scattering, 
according to which the scattering process is described completely by two quantities 
the scattering length a and the effective range 7. These quantities are calculated from 
the wave-function of the state for the zero energy of the incident particle, or by the 


function of the bound state if the system has a stable bound state with small binding 
energy such as the deuteron-state for two nucleons. 


In the case of electron-hydrogen 
scattering, there also exists a H™ state with a very small affinity, and its state function 


is calculated quite accurately by the variational method due to Hylleraas.” We shall 
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find a,, ™, for the singlet state using this function ¢/, (actually using Hart-Herzberg 
function) of the ion state. The values of a,, Yy; obtained are estimated to be very 
accurate, because the binding energy (affinity) of H~ is very small (~0.028) compared 
with the kinetic energy of the electron inside the Bohr radius, and also because of the 
reliability of the H~-function. 

For the triplet state, no such handy function exists, so that we have to construct 
the wave function of the zero energy state to find a,, 7. This function is calculated 
together with that for the singlet state by a variation-perturbation method which is, in 
a sense, a mixed use of both the variational method for Hartree-Fock and the second 
Born approximation, and_ provides improvements on them. 

§ 2 recapitulates the results of calculations hitherto obtained, from the standpoint of 
the effective range theory, and gives the values of a,, 7, determined from the function 
of the negative ion state. In § 3 the variation-perturbation method is described to find 
a and r, from the solution of the zero energy state. §4 contains the summary and the 


conclusion. 


§2. Effective range theory 


The Schrédinger equation for the system of a hydrogen atom and an electron is, 


in atomic units, 


Ai Spe pe *_+E} (ry, 7,) =0. (1) 
a az te iis Tho 


The atomic electron in a hydrogen atom in the ground state is described by a function 


é(r) which satisfies 


{2 +4 -+}4@=0, 
V2 r 2 (2) 


|scidr=1, 


so that 
d(7r) =e"//R. ° 
We shall normalize the function ¢/(r,, 1.) as 


sin (hry +05.) 


h(r,,%>) 20 (4) — < for 1,—> 00 
Hor 7) 96) OE 
> +4(%) AC aie p9D for 1, ©, (3) 
i r, Sin Ox. 


where -+ correspond to the singlet and the triplet states respectively. The total cross- 


section is given by 


1 5) a 470 i‘ (4) 
ee ees a a . 
Oo = (open a 4 oO ? af k’(1+cot? 04) 
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To express this cross-section in terms of the quantities arising in the effective range 
theory, the following function, which is equal to the asymptotic form of # (r,, T2), Is 


introduced : 


u(r 72) =o (ry) Sst Pe) 4 (ry OEE ed 6) 
T, Sin Ox r, sin Ox. 


Then, following the procedure taken by Bethe” in his paper, it is found that 


k2—h2) 
k, cot0,— k, cot 0, = Ooh) \ (usta — Pn Pa) dr,dr, 


0 


Es (1/27) G(k,, 0) {F (k;, oe) —27¢ (0)} 


¥F (1/27)G(k;, 0,) {F (ka, 92) —270(0)}, (6) 
where the extra terms 
Fat sin (&r+-0) A aXe sin (kr +0) 
Pk, =| 9 fame tO, De=lSO Ty dr 


arise from the exchange. 


Putting k,=0, and expanding according to the powers of k,°=k’, 
keotd= —4 “upto (7) 
a z 


is obtained, where a is the scattering length determined from the asymptotic form of 
zero-energy state 4)(r,, To) : 


uy (TF), Ts) =9(r%) (1—n/a) + 6(%) A—1,/a), (8) 
and r, is the effective range given by 


i= (1/47) (u'—9,*) dr,dr, +8 el —2/a) « (9) 


0 
The cross-section ‘is expressed as 
47a" 
SS ——— Tae ie ay 4 ° (10) 
1+ a4(a—r1)k*+ (ar)/2)°k 
For the singlet state, the cross-section can also be expressed by using the function 


of the negative ion state ¢,(r,,r)). For that, (6) is extended to the negative energy state 
by putting k,x=—iy, cot0,=—i. The result is 


k cot0= —7 + (0/2) Q? +) +0((772+8)), (11) 
where 
1 1 
pas — 55 \tedridr, (12) 


0 
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t, Cat Ug (r1, r) =0) sept) +6(r,) Spans) ’ (Gye) 


2 " 
and the cross-section is given by 


— oa 
+E +P C+E)/4)] 


(14) 


o.. 
If ~=r, (“shape-independent ” assumption) is assumed, the following relation holds : 


els el (15) 
a, eZ, 

The numerical values of a, 7, estimated from the existing data of calculations by 
various authors will be tabulated below. 

A) Massey-Moiseiwitsch. They included the exchange and performed two kinds 
of variational calculations, the one neglecting, and the other including the polarization. 
The data for k=O being not given, a and 1, are estimated from those of k=0.1, 0.2, 
and 0.3. 

B) Borowitz-Greenberg. They used 3-parameter trial functions for the zero energy 
state. The asterisk indicates the values deduced from (15) for a, obtained by the 
variational method and 7=0.236 estimated by Branscomb, and + those calculated from 
energy dependent approximate functions. 

C) Seaton. He tried various methods in solving the zero energy state. The values, 
polarization neglected, are deduced from the accurate numerical integration including the 
exchange, and, polarization included, from the 3-parameter trial function. 

D) We solved the zero energy equation by a method described in the next section. 
r, is calculated by (9). 

E) We also calculated p by (12), using the 20-parameter function of Hart- 
Herzberg,” which is normalized according to (13) with ;=0.236, and 


pH=3.37=% 
as 0s at Tor 
. 7.6 4.5 2.34 | 13, polarization neglected 
“ 7.08 2.34 2.34 1.12 polarization included 
; 8 16 4.047 Pie \ay) | 0.8087 polarization neglected 
B ‘ Sty 
7.74 3.85* polarization included 
oo 8.058 le ee 2.347 polarization neglected 
ie 7.01 233 polarization included 
9.02 2.350 1.29 first approximation 
i 7.88 2.346 | second approximation 
E 7.03 3.37 
pn a IE TETSU 
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is found. This value of may be fairly accurate, in view of the reliability of the H~ 


function. Then (15) gives 
1 oe 


We shall mention the applicability of the effective range theory to the electron- 
hydrogen scattering. To ensure the validity of the approximation assumed here, the 
contribution from the third term of (7) (or (11)) must be estimated. However, we feel 
that this approximation seems to be valid when the incident energy (and also the binding 
energy) of the electron is smaller than the kinetic energy of the electron inside the Bohr 


radius. If this is the case, the result: a,=7.03 will be very accurate, with an error 


possibly within a few percents. 


§ 3. Variation-perturbation method 


As has been remarked in the introduction, our problem of three bodies with two 
electrons and an infinitely heavy nucleus is the simplest of the many-body problems in 
quantum mechanics. But we have not on hand even a full knowledge of the analytical 
property of the state function at the singularities. The most effective approach practicable 
at the present stage would be the variational method. Indeed, for the H™ ion, very 
accurate calculations” have been performed with a large number of parameters. On the 
other hand, in the scattering problem, only three parameters have been employed,” among 
which only one is responsible for the polarization. It would be worth while to look into 
the effect of polarization in a more systematic way. 

The second Born approximation is certainly one of the powerful methods to see the 
general tendency of this effect.” However, for the low energy electron, this is not a 
convenient method to give a good approximation. Here an attempt is given to solve 
the problem by a mixed use of the Born approximation and the variational method, which 
is also equivalent to finding F,, approximately, when the exact solution is expressed in a 


form” 
(rts) =(St |) BCDP) + bE}. 


The general idea has already been put forward in a paper by one of the authors (T. 
O.). Here the procedure is described briefly, with a special attention to the degree of 
approximation. 


Now the problem is to solve the equation 


Ay,.(r,, r,) =E, $e(T1, Pe), 
B,=—-—4+—, Ha-S_4&_ 1 _ 1 (16) 


where ¢/, is normalized as (¢,, Mn) =O (k—k’), the inner product (¢, ¢) being defined 
as || 9(r,,r,) ¢(r,,r,)dr,dr,. Let an approximate solution in the Hartree-Fock approxi- 


mation be 
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Poo (T1, Te) =o (71) f(r) = (To) f(r;), dy (r) Sern (17) 
where f(r) is taken, with two adjustable parameters a and 6, as 
iG) Mle) 4 ser” )of tekibeat—nfa} (18) 


Let the other approximate functions to constitute a complete system corresponding to 


excited states of the hydrogen atom 4, be 


1 i 9 i 
On ar 19 (1) sin kr, a5 d,, (7) zai > (19) 
ve ip) 


" 


to which the ion state @, will also be included. Then we find 


oo 


2 (Drz (H+ 3) Do o) 
by o= ae ives ee dk 
$0,0= ¥o,0 >| Ea sali Pn nd 
+ (2/7) (9, (H+1/2) Go,0) Gp +O (Cfo 0, 8:4), (20) 
where Ey) ,=—1/2, E,,=k/2—1/2(n+1)*. The order of the errors is estimated as 


follows. Let the error of 9,, be 0¢;,~4, which cannot be very small for the low 
energy collision, and 04, is expected to be less than 4. 9, with two adjustable para- 
meters, may have smaller error € compared with 4. Thus, taking 


F \ 0.=o,0 ae yA \ 


0 


.(H— 0.0 0 
Par HT E9) G0) oa 
ok Baie j 


Py o=Po0+O(E- 4) (21) 


as the trial function to be put into the variational expression, the scattering length a is 


then calculated from 
—1/a=—1/a,+ (1/47) (¢,, (H—Eoo) G1) +O (4¢,- 4¢,) (22) 


where a, is the scattering length determined by the asymptotic behavior of the trial 
function ¢,. The approximate value of a is determined by taking ¢,=%,=p, (first 
approx.), %,= Fo, Y2=Y0 (second approx.), Y,=%,=Vo (third approx.), for which 
1/a will contain the errors &, €4, €4? respectively. (In the Born approximation the 
unperturbed functions are used for both y, and ¢,). The second approximation in the Table 
is calculated retaining only n=O (ground state) and 1 (2s of hydrogen atom) in the 
summation. The parameters in (18) are determined by the Hulthén variational method 


as d,=9.02, b,= —0.396, 4,=2.350, 5,=0.312. 


§ 4. Summary 


a) Singlet state. The effective range theory is developed for e—H scattering. 
According to the theory, the effective range has been calculated by making use of the 
H--ion wave function, which was obtained by Hart and Herzberg very accurately, by 


the variational method. The effective range is found to be %,=3.37. Using the binding 
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energy (affinity) of H~ 0.0278 which was obtained by Midtdal and Hylleraas with 
the variational method, the scattering length is found to be ¢,=7.03. The values of 
4,, os obtained are estimated to be very accurate, the errors will be of the order of 
a few percents, because of the reliability of the H--function and also because the binding 
energy (affinity) of H™ is very small (~0.028) compared with the kinetic energy (~1) 
of the electron inside the Bohr radius (namely, the validity of the effective range theory 
in the very low energy region). It is to be noted that this agrees well with Seaton’s 
value a,=7.01, (which has an error of within 10% according to the author’s estimate). 
On the other hand, the value 4,=7.88 determined from the zero energy function cannot 
be much relied upon, since the effect from higher excited states may not be negligible. 
b) Triplet state. In this state, the effect of polarization is automatically taken 
into account to some extent by the anti-symmetrization of the state function, so that the first 
approximation 4,== 2.350, %—=1.29 may not be a bad value. The accurate value of a, 
in this approximation, according to Seaton, is 2.347. By including the 2s state only as 
the excited state, 4,=2.346 is obtained, whereas Seaton obtained ¢,=2.33 in his calcula- 


tion including the polarization. 
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Scattering of antiproton by nucleon is calculated assuming “nucleonic charge independence”, in 
which nucleon and antinucleon are assumed to correspond to the third component of an angular 
momentum in the charge space and its conservation is assumed. It is shown that according to this 
assumption, the elastic and the charge exchange parts of the antinucleon-nucleon scattering are related 
to the nucleon-nucleon scattering in a simple way, just as neutron-proton scattering is related to the 
proton-proton scattering by charge independence. Explicit results are given for the total cross sections 
and the angular distributions of the elastic and the charge exchange parts of (pp) scattering and for 
the elastic part of (fn) scattering. As far as the comparison with experiment is possible, the result 
predicted by the theory is not unreasonale. 


§1. Introduction 


In a previous paper,’ we proposed a scheme in which nucleon and antinucleon were 
assumed to correspond to the third component of an angular momentum in the charge 
space. It was shown that if pion-nucleon interaction was assumed as invariant under 
rotations in [ and K-spaces, it could not be local, especially the part responsible for the 
creation and the annihilation of nucleon pairs. It was conjectured that this non-locality 
might be responsible for the large annihilation cross section. 

In addition to this, it is expected that, if the conservation of K-—which we shall 
call nucleonic charge independence hereafter —is assumed, the antinucleon-nucleon scatter- 
ing might be related to the nucleon-nucleon scattering in a simple way just as neutron- 
proton scattering is related to the proton-proton scattering by the charge independence. 
Therefore, it is expected that by examining this relation, we could check the validity 
of the assumption of nucleonic charge independence, or conversely, we could find a new 
method of approach to the problem of antinucleon-nucleon scattering. The purpose of 


this paper is to discuss these points in some details. 


§2. Seattering of antiprotoa by aucleon 


According to I, proton, neutron, antiproton and antineutron are specified by [, and 


K, as follows : 
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L, K, 
p 1/2 1/2 
P Sah —1/2 (2-1) 
n —1/2 1/2 
n NW —1/2 


Therefore, for a system composed of a nucleon and an antinucleon, there can exist 
four states corresponding to the triplet and singlet states of J and K. In the following 


we shall calculate the scattering of such system assuming the conservation of I and K.* 


1) (pp) scattering 


Let us introduce the following notations : 
I, I,o [;-, : triplet eigenfunction for I 
is : singlet eigenfunction for I 
K,,, K,o, K,-, : triplet eigenfunction for K 
K, : singlet eigenfunction for K. 
In terms of these eigenfunctions, free (pp) system can be written as 
(1/2) o—h) (Kio— Ky) e™* x (5s, m,), (2-2) 


where 7(s, m,) is the usual spin eigenfunction. (2-2) must be antisymmetrized accord- 


ing to the Pauli principle. After antisymmetrization we get 
(1/\/2) (Ly) Ky+ I, Ky) Da ae Q } 2) (Ky <5 TK) Ps, (2 r 3) 


where ¢/, and qx are antisymmetric and symmetric parts of the spatial part of (2.2), 
and are explicitly given by** 


ity Bre oe pal i 4 es) ere 

UT ae Sa DY (1) *./21+-3 

yr J=0 M=-sl=\J-s| 2 
. —t(kr—ix /2) itr— rr > 

X Go (JM > Cin) fer in --. tn) GE (2-4) 


and 


it Sit oo > J+s L— { — its , 
Picea >} aheek: Cost) nee y 2l+1 
r Jo) Ma-—Jl—| J—3| 


ae (JM ; Om,) {ea *ar-tni2) — et(tr-tnid) yx. { 
OymM : : ‘ 
where ed is the spin angle wave function 


i s 
Te. ope 21 C,, (JM ; m, m,) Yim, (Ge) x (s, m,) b (2 . 5) 


i 


* 1 . eae “ 
Calculations in this section are made along the scheme given by Blatt and Biedenharn®. As for 
the details of the calculation not described here, reader should refer to their paper 
OK We . . : 
e assumed that the interaction hamiltonian is ic wi 
symmetric with res 7 
(¢,, +o)" is a constant of i d i : iy Gea Coe 
pt or nt of motion, and the triplet and the singlet scattering can be treated separately. 
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and C,,(JM ; m,m,) is the Clebsch-Gordan coefficient. 

In the presence of the interaction, outgoing parts of (2-3) are transformed by the 
S-matrix. Since we have assumed that I and K were conserved, S-matrix is diagonal in 
I and K. We write the matrix element correspoding to the transition from / to I’ in 
the state with the total angular momentum J as 


S7(LK bills 5), 


and the outgoing wave as 


ky aa “a <a | M. ‘ / i(kr—lx[2) O4M 
a. Di Pl > > SG) ; 0 m,)ty/ 2 ar bse 4 ins 
r J=0 M=—J l=|J—s| U=|7—s| 


x ml Ep lee Das ij ko eS(0O es) BC 
(WZ 2 


= 1 {$7 (011s 5) Ky +S7(10ll; 5) 1K} i, (2-6) 


Here terms in { } can be written as 


S717; 5) TK +S” (O0l'l; 5) HK, 
= (1/2) {871111 s) + 870011; 5)} (Ty Ky + KK,) 
+ (1/2) {87 (1101; s) —S7(O0ll s)} (LK 1K)» 


and 
S7(0101s s) I,K +S" (100 ; 8) LK, 
= (1/2) {S7(OWL ss) +57 (100 sus) }.U,Kin + LoKe) (227) 
=F (agai 482 (lll; 5), = S21 001361) (Kg LK): 
Here 
IK t+Ko= (p P) + (? p); 
TpK—IK,= (n 2) + (in), (2-8) 
[Ky + LoKo= (p P) — (P p), 
and 


LK — IK) = — (nn) +(n n), 
where (p p) ((nn)) and (pp) ((nn)) respectively denote the states where the first 


particle is antiproton (antineutron) and the second particle is paotee Cagneron) and the 
first particle is proton (neutron) and the second particle is antiproton OC asee>): 
Therefore, the first and the second terms of each expressions of a(2 a) Rresbecuvely cor- 
respond to the elastic (P+p—>p+p) and the charge exchange (p+p—n-+n) scattering. 

In the following we shall discuss the elastic scattering at first. The scattered wave 
for elastic scattering is obtained by taking the difference between the parts correspond- 
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(2-6) and those in (2-3). Here it should 


ing to the outgoing waves of antiproton in 
the role of the first and the second particle 


be noticed that in terms containing (p Pp), 
must be exchanged. Thus, 4 changes the sign, and we get 


+8 


= — oo J J- JS+8 - x 
ik|/% 3) >? y p> c. (JM ; Om,) i) /al+ 1 et (er—tx]2) Ye 


J=0 M=-Jl=|J—-s| ll=|\J—8| 


He = 
r 


1 f2e CD" fa,,—4 {ar ts ) +(00!l; DY} @-9) 
V2 2 - ' 
FED {On — 218" OU) +$7(10l'l; s)] Len) — (pP)]- 

2 , 


x 


- 


Decomposing gk we can write this as 
eikr 8 F 
$i ,=it ae ee pa Ym! 7m. (Ae) 7(s, m, (2- 10) 


r Led a 


with 

Cs JS+8 JS+8 uv —T P ; 
SSS SD PP" /Y/a(2l4+1) C,.(JM; 0Om,) Cy, (JM ; p'm,’) 
—J 


l=|J—s| l/=| J—s| p/=—ll 


Gur im 0, 0) => 
See, J=0 M 


— (1/2) [S71 s) + S7(00l'l; s) }} (2-11) 


an ee 69 


poly ane oe 
1—(—1)'** ¥ . J yr, ey oe ; , 
A 1 Bae (1/2)[S7(o1l; 5) + S7(10ll; Db fe P) — (PP) Yin (O¢). 


Thus, the cross section from the initial state with magnetic quantum number m, to the 


final state with magnetic quantum number m,’ is given by 


dos, (m,’ ; ms) = nr: m (8 ¢) ['d2. (2-12) 
Total cross section for unpolarized beam can be obtained from (2-12) by summing over 
m,’ and averaging over m,. Explicitly, it is given by 


x? 8 s 
el sod a] ’ ol / 
do“, (s) oan re po do, (m, 3 m,) 
2s+1 La) 7 Cae s 


Le a _ - 
= eal we >) et > y PS arte mms ot ty 2 Jele!ls ; s@) dQ 


henatl 
25+ 1 Cr eee) ae: eae Cm | 


il —_ fits oH fats ; i 
x | eeu. De {O,2.— (1/2) [ (114,/L,; s) +5" (o0l,’l, ; 5) } * 


X {Orartg — (1/2) [SL 1ly'ly 5 8) +572 (00ls'ly 5 5) } 


eaten fits (ae la+s 
+ 2 . iv {De — (1/2) [S*(01L,'1, 5 5) +5 (101,', ; 5) }}* 


X {9 rata — (1/2) [|S (01s 5 5) +57 (101,'l, 5 5) }} (2-13) 
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1 (—1)"*8 i (aye 
Z W 

X {9r.12.— (1/2) [59 (O15 3) 0 (1011, s) }} 


il ae “its =f la+s 
++ ' ( 2 " ave a ae {9z,101 — (17/2}i9° (0111, > s) +57 (10,1, 3 s) }} i 


X {Orerta— (1/2) [S (O1Ms't, 5 5) +57 (1 0le/ly ; >| 


a 


Baty — (1/2) [5 (114 5 8) +8 (0011, 5 5) ]}* 


where 


K(J, Lis Idle s0) = 2l,+1 y 2l,+1 us ey oe) ps Sy SS >a 


m m My, M2 wil tol 


X Cas (JM, 5 Om.) Cros (JoMez 3 Om) Cons (Jr Mi ps! m3) Cinrs (JoMe 3 Pa! m,!) (2-14) 
x Yiuu™* (OG) Yoarvar (99). 
Here summation over magnetic quantum numbers can be made using Racah’s technique, 
and we get 
pha 


doz, (s) = 41 eR Be (L, 5) P, (cos 0) dQ, (21S) 


where 


BEL, )= (1/4) BHVUVYD Zhhs DZ hs st) 


+s — la+s 3 
Ref = ee ots 2) {0,,2,— (1/2) (S" (11471, ; +5" (00L/h3 5) |}* 


X {Orortg— (1/2) [S? (110, l, 5 5) +572 (001,'1, ; 5) ]} 


SEE aia (1/2)183 (01 Ad 38" C01, 59) I}* 
2 Z 


x {O:41t2— (1/2) (s7 (011,'1, 3 5) +S (10/1, 5 5) }} (2: 16) 


Ce fh, 1/2) (11 9) 4S (0011, s 1) 
Zz 2 


X {Brarta— (1/2) [S72 (Oly! ly 5 8) +578 (10l5'l, 5 5) }} 


pe DE DE a 1/2 SOL; 9.45" 10h! 5 8) ]}* 
#) 2 


X {Bryn (1/2)15% (11h 5) +5% (00K! ; 5) } | 
and Z(l,J,lJo; sL) is given by 
Z(l,Jubode 3 sL) =i-2*%/ QL +1) 2h+1) 2+ 1) @h+) 
x W (LJ loJo ; sL) Cry (LO : 00). (2: 17) 
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W (L, J, ls Jo; sL) is the Racah coefficient. 
In the following we shall neglect coupling between states with /=J-+1 in triplet states. 
Then, S’(IKI’l; 5) can be written as 
SCIKIT < s) =d,y, SEK ies), 


and introducing phase shift by 


( exp (2i0),) for K=1 
S7CLKI ; sy ; (2-18) 
' in exp(2iS),) for K=0, 
we can write BY (L, s) as 
BE (L, 5) = (1/4) SDS Zak ss DE)? 
Jy Ja ly la 
x {sin Orn sin OFnta cos (O51, — FFota) + sin oy, sin Cola cos (F542, — 5 Sota) 
+ sin OF. i sin cae cos COs S8x)) + sin 4p sin Oi cos (05,7, —* 5,1) 
+ (1/2) (1—75,2,) cos 25,2, + (1/2) (1 —75e1,) cos 25540 (2-19) 


si (1/4) (nan IE ee 1) cos 2 (Shs, aca ne == (1, 4) (Hr as 1) cos 2 (B5.2.—F 504) 
a Q 4) (75212 — 1) cos 2 OF — Fu) . 


In (2-19) we omitted suffixes for isotopic spin. They should be taken so that they 
satisfy the Pauli principle. 


As will be seen from (2-17), Z(LJ,/Jo; sL) is purely of a kinematical nature, 
and the dynamical features of the system are all contained in Re|---| in B&(L,s). We 
shall hereafter call the former the kinematical factor, and the latter the dynamical factor. 

The total cross section is given at once from (2-15) using the relation : 
ZU JiloJe 3 0) =9nr Opmy(—1) *V 2],+ 1. 
Thus 
OF (5) = (FAP /25+-1) SISIS} (2J +1) 
J i it 


> (2-20) 


ot be +s 
x 3 an 1/2)[S AUT; 8) +5” (00; 5)] 


$A d= (1/2) (SOU; 5) + 8°00; 5): } 


which, neglecting coupling in triplet states, is simplified to 
of, (5) = (78? /25+1) S15) (2J+1) {2 sin? 05, + 275, sin? &, 
J t 
—an sin’ (0),—F) + (1/4) (1-7) *}. (22%) 


Total cross section is given by summing (2:21) over s corresponding to singlet and triplet 
scattering : . 


An Attempt at Reformulating Pion-Nucleon Interaction, IT 95 
Ory— (1/4) oF, (0) + (3/4) of, (1) 
= (wH2/4) SSIS (2F-£1) (2 sin? Bty-+2 7p sin’, (2-22) 
OS ae) 


— Hi sin” (05, —F%) ate (1/4) (1 —7n) *} : 


Analogous to (2:9), the scattered wave for charge exchange scattering is given by 


ik oa © J JS+8 J+s ’ ' : 
—— > eS eS es C;, (JM ; Om,) yf 2l+1 ei ee ae 
r J=0 M=—sJl=|\J—s| ir=|7—s| 
5 e... lis if 
x >| 1) (2.1/2) (SIG AU 51s) = S57 (O0l1sS),| 
[Vee ile 2 


Seas (—1/2)[S7 (ol; s) —S”(10l'l; 8) Hl ie 


Therefore, it would be clear that the differential and the total cross section for charge 
exchange scattering are obtained by making following substitutions in the dynamical factors 
of (2.16): 


On — (1/2) [87 (1101; s) +7 (00lT; 5) J (—1/2) (S711; 5) —S7(00ll; 5) ], 
On ty 2) 1S (Ol sy eo COLT 9s) (1/2) (S7COlre sie) — S7CLOlT sy. (2°23) 
Thus, 
dorzy* (5) = (H2/25-+1) S) Bost (Ly 5) Py, (cos 4) d2, (2-24) 


with 


Brs*(s) = (1/4) SSISISISIS) ZG Shes # ZU Fuld as 5D) 


2 11 la ty la 


SCs — (=P 1+ (— 1)" 54 1, ; s) —S* (001/15 9) 
4 2. 2 


[52117 sy —S (00/1, ; 5) | 


aa te 1 c= [S*(o1l,/l, ; s) —S*(101/, 5 8) }* 
2 


x [578 (01ly'ly ; 8) —5 (10hy'ly 5 5) ] 


te a ee a, pov SRO 1 
DD Z 


x (4 (Olle > 5) — 5 (101,'l, 3 s) | 


= Mie (—1)"*" 1+ (=1)""" 5a COLT, : 5) 8 (101,/C, : 5) |* 
2) 2 


x [87 (1 1l,'l, 5 s) — 5” (001,'l, 5 s) 1} > 
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and 
Oz (5) = (FA) /2s+1) SA +1) 


x | tf" S7(1'l; s) —S/(O0l'L; 8) 2+ 
4 I=)" | sro; 8) —S7(10l'l; 8) *|. (2-25) 
Zz 


Therefore, under the same approximation as before, we get 
Bry" (L, 8s) = (1/16) Bias <b Kibhs sD) (—-" 
X {cos 2 (05,2, — Oy.) — Yn, COS 2 (5,2, — Par) (2-26) 
Fg COS 2 (FSet2— Ont) tnt Bots COS 2 (SSits—S Fate) } > 


and 


Cay = (HH 4) SIS > I+ 1) ty sin’ (O5,—€5:) + (1/4) (1—yn)*}- (2-27) 


Annihilation cross section is obtained by calculating the decrease of the outgoing flux 
relative to that for the free system. Since annihilation occurs only in channel correspond- 


ing to K=O, we have 


Onp” (5) = (7%? /25+1) (1/ 2) 21> 3 (2J+ 1) 


hes ae oe dis 
Xie aaa 2 whe cae ae |S” (00l' ; 5) i) Pg tema sans ( 2 , (On — iS’ (1001; 5) yl > 

(2-28) 
from which we have 


ony) = = (ze, ‘8) 23212) (2J+ 1) = (7%: n)*} j (2-29) 


neglecting couplings in the triplet states. 
2) (pn) scattering 
Wave function for free (pn) system satisfying the Pauli principle is given by 
Far Koy Pa—hr Ky os. (2-30) 
Thus, analogous to (2-9), the scattered wave is given by 


ihy/ 7 = x ge Jv+8 7 
Se Sty es >i Ci,(JM3 Om,)? y /214-1 etr-ta) G Mw (2-31) 


piled 
J=0 M==-/ l=|J—s| ir=|J—s| 


1 1+(—1)!*s ; Bag SS f+s < 
x a[ OM CaN D+ ATV t0y,—S7(000; 5} 


XL n)—(n p)}. 
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Therefore, calculation can be made entirely analogous to that of (pp). As for the 


elastic part, we get the result by making following substitutions in the dynamical factors : 


On — (1/2) (S711; 5) + 87 (0001; 5) | 0,,,—S7(11U7; 5), 


and 
On— (1/2) [S87 (C01; s) + S7 (101; s) ]>0,,,—S7 (1001; 5). 
Thus, 
da,” (s) =(47/2s41) > Bz. CE, 5) Px (cos 0) 42; (2-32) 
with 


Bz (L, 5) = (1/4) Sidi >> ZL Jiledos sL)Z (Ly Sila Jos 5 2) 


Ji J2 ly lo yt lol 


1 al i4+8 = loa+s 
x Ref = 2 sas 2 2 {Orn — 5 (1111, 5 5) }* 


x {Orortg— 5? (115, 5 s)} 


t= (—1) 2** i= (Gan Bay 
2, 2 


+ {O,1,—57 (101,/1, 5 5) }* 


x {Drorta —S(101,'l, ; s)} 


eee Ai+s ay gk la+s 
-itCv™ + EO (Bay, SH DY 


x {Orart2 —S(10/,"1, ; s)} (2233) 


ioe +s = l2+8 
oe AF Bauey 54 (108/13 9}* 


x {Brartg — 57 (11's 5 | , 
which is simplified, in the same approximation as before, to 
(1/4) SSS SY Zh Sibdes sD)?’ 
Jy Ja ly la 
xX [4 sin 05,1, SiN O14 COS (05,1, — Ota) + {1 — Inn COS 255.0, — Naat COS pete 1(254) 
+7 Trt, COS 2 (Fyn — Su) | + {2 —cos 205.1, — Jala COS QE Fate 
+ Hr, COS 2 (05,2, —Fsat2) — COS 20522 — Int, COS 265.1, +72, Cos 2 (O50 —F st) } | 


In (pn) system I=1. Therefore, the Pauli principle restricts possible / when s and K 
are given. >J in (2:34) means summation over those / only which are consistent with 
Z 


this restriction. 
The total cross section is given by 


Ton” (5) = (7%? /25+1) pap ay (2J+1) 
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fees 3t, oo its \ , 2 
5 ee bn — Salt; 8)? 4 2—— 8, -H QO; ) FF, 


2 2 
(2-35) 
from which we have 
6, =x" DS} Dy, 2 2)+1) {sin® +7 n sin? $5, + (1/4) (1—7y)*} (2-36) 
J i s 


neglecting coupling in the triplet states. 
The annihilation cross section is given by 


fas 
50 (6) = (@B/25+1) SIFY A= LY Gn =| Q0l; 9), (2-37) 
J &£ @& 


which gives 
Op," = (FF /4) > SY VAIJ+1) {1— Ga) *} (2-38) 
vy s 
in our approximation. 

For the latter purpose, we shall give here formula for (pm) and (pp) scattering. 
They are given by making following substitutions in corresponding formulas for (pp) and 
(pn) scatterings. 

(pn): 6,,—(1/2)[S7(il'l; s) +S’ (C000; 5) }6,,-—SH Awl; 8), 
Om— (1/2) [87 (017; s) +87 (101'l; 5) }>8,,,—S7 (011; 89), 
(pp): O2—S7 (WT; 8) >) 2[6,,—-S (aw; 5), 


8:.—S7(10'l ; s) 0. 


Thus, 
do, (s) = (F#/2s+ 1) 3B, (L, 5) P, (cos @) dQ, (2-39) 

with 

B,,, (L, 5) a > > 2 (ZC Siete: 5 L))* 

Xsin Oh, sin Dy, cos (O42, —O85s5)5 (2-40)* 

and 

Ty = KE DIT (2J+ V sin" Con, (2-41) 
and 

do, (s) = (42/2541) D3B,,, (L, 5) P,, (cos #) dQ, (2-42) 
with 


* Tn (2.40) — (2.44) couplings in triplet states ate of course neglected, 
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B,, (L, 5) —_ SAPS (Z(L,J; l, J. moe) L) Msi 


Ji J2 Z2 
> Ns 2 Ss DNs Ns 
X sin 07,7, Sin O7,;, COs (05,2, — Fiera) » (2-43) 


and 
cr api ap abs Da (2J+1) sin? 0%,, (2-44) 
which agree, of course, with results given in usual literatures. 
§ 3. Comparison with experiment 


In this section we shall discuss the results obtained in § 2 in some details. To 
make the comparison with experiment concrete, it is necessary to simplify the formulas 
obtained in § 2. In the following we shall neglect, $5, against 05, from the following 
reason. In the state corresponding to K=0, the annihilation cross section is very large, 
and moreover to reproduce the observed value (90mb at 450 Mev), the range of the 
interaction must be taken considerably large, at least of the same order of magnitude 
with the range of nuclear force. Therefore, the intensity of the incident beam will be 
strongly reduced in this region, and the phase shifts will be correspondingly reduced.* 
We also assume that 77, is independent of s and I. 

Writing down quantities comparable with experiments directly in this approximation, 


we get following results. 


* If we describe the annihilation by an imaginary potential —iM7(r), the intensity of the incident 


antiproton after traversing from 7) to r in the imaginary region is given by 
eG)? 0 expl—| WW (7) Ea) ko ar], 
TO 


where Ey is the incident energy and kp is given by ky=V ME,/#2. Thus, if we assume for W7(r) 
W(r) =W  e-'7!®/(r/a), 
we get 
| (r) |? cc exp [(Wo/Ep) (koa) Ei(—1/a)], 
here Ei(—r/a) is 
E,(r/a) =—\~ 


e—!/t-dt. 
rla 
Experimental values at 450 Mev are reproduced approximately by taking a= (b/c) and W>=70 Mev. Subs- 


tituting these values, 
|p (r) |? oc exp [0.9 E;(—1/a)]. 


From this expression we can estimate that at about 450 Mev |¢(r) \2 is reduced by about factor 4, 2, 
1.5-++ at r corresponding to the impact parameter for s, p, d-:-waves respectively. Therefore if antinucleon- 
nucleon potential is not so much different from the nucleon-nucleon potential, and if it is not so strong as 
compared with E), and the computational method such as the Born approximation can be used, we may 
expect that the &*,y,’s are smaller than 6*7;’s by about factor 4, 2, 1.5---. Unfortunately, this conclusion 
is sensitive to the form of W/(r), and if we assume shallow and long ranged form for W(r) instead of 


singular one such as the Yukawa type such reduction cannot be expected, 
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1. (pp) scattering 
i) opf'= (1/2) 0,,+ (44°/4) SAITD G — 93)? — (2/4) SAI +1) gs st! Ons 


(3-1) 

ii) ops" = (#H/4) J+) (1—9,)?+ (#2°/4) SIH 1) gs sist On, (3-2) 

fit) os, nh) 2) a2) +1) ae are 
J 


iv) Bee CL, s) = (1/4) B,,, CG s) ap (1/4) | 2 (Z(L,J, l, Jo 34 L))* 


x {(1/4) d—92) A—an) + G/2) A— 75) sin® Ors 
+ (1/2) (1—7,,)sin® 0%,,,}, (3-4) 
vy) Beit (L, J = 0/4) 3) Zi hhs sD)(—) 
Jy Jelrle 
x {sin Orn sin O10 cos (0%, — 9557) 


= (1/4) (1 Fy) (1 — 71) a (1/2) ie! —77,) sin” O5—t2— qd ‘'2) qa — 77) sin® On} : 


(3-5) 
2. (pn) scattering 
i) 95," = (1/2) 0,,+ 2/4) > 2J+) 24+(-)9 G—w)s (3-6) 
Hi) op, = (8/4) QJ 41) 2+ (—-)) AF), (3-7) 
fii) Bay" (L, 3) =(1/2)B,,(L, ) +) SY (RL Jiledes sD)? 
X (1/4) (1—92,) (1 —na) + (1/2) (1-7) sin? 05, 
+ (1/2) —7,,) sin® Osi} : (3-8) 


One feature of these results is the appearance of a factor 1/2 in the expression for 
, which is due to the fact that under the assumption of nucleonic charge independence, 
the part contributing to the annihilation is only one half of the incident flux corres- 
ponding to K=0. Therefore, we must take the range of annihilation larger than 
usually assumed to get observed values.* This means that partial waves higher than that 
usually assumed can contribute to the annihilation, and therefore the scattering angle of 
shadow scattering becomes correspondingly smaller. 


Another feature is the appearance of o.(o.) and B,,, (L, s) (B,,,(Z, 5)) in the ex- 


pp 
Pression for o5,"(o5,°) and B;,"'(L, s) (Bz, (L, s)), which is due to the presence of 


om 


* If we oer 
» for JZR/A 

ink 
0 for J>R/A, 


a” is given by 
ot (1/2) zR?(1—7?). 


In this approximation, therefore, we must take R V2 times larger than usually assumed, 
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States corresponding to K=1. As expected before, this enables us to relate some parts 
of antinucleon-nucleon scattering to nucleon-nucleon scattering, and it is expected that these 
relations can be used to check the validity of our assum ption. 

Let us begin with the angular distribution of elastic ( pp) scattering. Although 
direct measurement has not yet been made, we can draw same conclusions on the angular 
distribution from data on the attenuation cross section measured under different conditions. 
Data now available is as follows : 


Table I 
= . . . 
Author Energy Cut off angle | Total cross section Apel 
I) Chamberlain | 457 MeV 14° 105++8mb 87+7mb 
et al *) 
Tl) on 3) 457 MeV 20° 102+8mb 87+7mb 
III) Piccioni et al » | 500 MeV ee 97 +4mb 


The difference between the total cross section and the annihilation cross section gives 
the sum of elastic scattering (p+p—>p+p) outer than the cut off angle and the exchange 
scattering (p+p—n-+n). Therefore, the differences between I) and III), II) and III) 
and I) and II) give elastic scattering cross section between 3° and 14°, 3° and 20° 
and 14° and 20° respectively. Unfortunately, the error is very large at present. We 
may say as a very crude feature, however, that the forward scattering is relatively small, 
and the main part might be backward scattering. There is some difficulty to understand 
this feature from the usual theory, since if we assume that the real part of the phase shift 
is small by the same reason as we neglected o5, against 0%,, we must assume that the 
elastic scattering is mainly shadow scattering. If nucleonic charge independence is assumed, 
the scattered amplitude becomes the superposition of (pn) scattering and shadow scatter- 
ing as seen from (3-4). In the former, the main part is the backward scattering. 
Therefore, if the shadow scattering can be made small, the main part of the elastic 
scattering can be backward scattering. This is made possible by assuming a large and 
rather transparent annihilating region. 

Next we shall discuss the total cross section. From (3:1) and (3-2) we get 


Cm +o = (1/2) oy eo", (3 : 9) 


where 


o*= (nt/2) S\(2J+1) (1— 7)" (3-10) 


is the shadow scattering cross section. At present we cannot say anything definite about 
it except that it is small. Therefore we shall tentatively neglect it and try to compare 


Opp tp (1/2) oop, (3-11) 


& an 


. . Z 2 . . uh tot a 
with experiment. 05, + p,°" is given by o5» Trp 


At about 500 MeV, necessary data are given in Table I. Using these results, we get 


a) From Chamberlain’s data : 
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O52! +050” =15+10.mb. 
Therefore, the agreement is satisfactory since (1/2) o,,—16 mb at this energy. In this 


case, the cut off angle is large (14° and 20°). So that his total cross section does 


not include shadow scattering, and the error due to the neglecting of the second term of 
(3-9) is (1/2)o* (coming from the charge exchange scattering). We may assume that 
this is smaller than about 10 mb. 

b) From Piccioni’s total cross section and Chamberlain’s annihilation cross section : 


2668 10 mb. 


: up Tap 
Therefore, agreement is not so good in this case. It should be noted, however, that 
this result is obtained by combining data from entirely different experiments and so there 
can exist a difference in the normalization of the cross sections. 

A more definite relation exists between the charge exchange scattering cross section 
in the forward direction and the neutron-proton scattering cross section. As will be seen 
from (3-5), the angular distribution coefficient for the charge exchange scattering con- 
tains an extra factor (—1)’'*’”. Therefore, if we replace 6 by =—@ in (2-24), we 


get altogether (—1)”*"*”, which is always +1 since there exists a restriction 
L+l,+l,=even 
in the summation over /, and /, in (3-5). Therefore, if we write do;,°°(4#) as 


doz_°* (0) = (#*/4) > BYzy*(L, 8) P (7-0) 42, CM} 


Vp Pp 
BY,’ (L, 5) is given by 
Boy (L, s) = (1/4) = (ZL SiloJo; sL))* {sin O5,), sin 5... cos (O85, — Paro) 
‘ Jy Jalil 3 
(3-13) 
es (1/4) (1 —7n) @ ee ia, (1/2) (1 %.n) sin” Osta — e! 2) (1 = yo) Sin® O54} : 
The first and the second term of (3-13) are the angular distribution coefficients 


for (pn) and shadow scattering respectively, and the third term is their interference 


term. Therefore, the angular distribution for the charge exchange scattering is given by 


doz," (0) = (1/4) do, (2-0) + (1/2) do*(z—A) —do™(z—A). (3-14) 


In our case, as will be seen later, do‘(x—@) collimates entirely in backward direction 
and do (z—@) is also inappreciable at angles smaller than about 120°. (See later dis- 
cussion on the angular distribution.) Therefore, the charge exchange scattering in the 
forward direction is related to ( pn) scattering quite definitely by 


do;,*' (8) = (1/4)do,, (70). (3-15) 


The cross section for the charge exchange scattering within 17° was measured by 
Button et al at 440 MeV." The result is 


Oxy” (8< 17°) =3.041.6 mb. (3-16) 
On the other hand, do, (0) /d2 was measured by Hartzler®? at 400 MeV. Integrating 


An Attempt at Reformulating Pion-Nucleon Interaction, IL 103 


his result according to (3-15), we get 


ee a). 


55” (< LZ) = (2/4) aC sin 7 dé 


=1.4 mb. (3-17) 


Again the agreement is satisfactory.* 

Similar discussion can be made for (pa) scattering, too. In this case, however, 
available data are very poor. Therefore, we do not proceed further, and remark only that 
the total cross section and the angular distribution in large angles are closely related to 
(pp) scattering as will be seen from (3-6) and (3-8). 

We shall finally make a rough estimate of various cross sections assuming a simple 
square well imaginary potential to describe the interaction in the state corresponding to 
K=0. The range was taken as 2.5, 3.0 and 3.5 (b/c) (Compton wave length of 
pion), and the depth was taken so that they gave 82 mb for annihilation cross section 
at 450 MeV. The results at 450 MeV are given in Table II. 

Thus, the exp2rimental results are reproduced by taking the range ~3X (4/JUc).** 

We shall also give angular distribution for the elastic and the exchange part of (Ppp) 
scattering and for the elastic part of (pm) scattering in Fig. 1 and Fig. 2.*** As it 
has been seen, the effect to the elastic scattering from channel corresponding to K=0 


* If nucleonic charge independence is not assumed, opp’, opp** and Bpp**(L, s) are given by 
opp?? = (rA?/4) 3) (2J+1) t2n8 7. sin? 8 s2+2y% sz sin? €o% 77 
Jls 


“(l= 92 72)? — Gz) sin” ye 71—S08 70) }, 
opp?” = (x4"/4) ~ (2J+1) (8.72)? sin? (€8 177-072) » 
! Jts 
and F 
Brp®” (L, j= (1/4) PLS DES, (ZL fi lees sL))? 
Jy J2 hy la 
Xara 787222 sin €y8 xyz sin €y% 7272 cos (Ey 87171 — €1% 222) 
+48 F111 O° Foto SiN Fo% 7,2, Sin Eo% 7y2q COS (€0%.7,2, — £0" Fara) 
— 2 Fri UV Fale Sin Fo% 7,2, Sin €1% Fa2q COS (F0%.7,2,° — £1 Fala) 


=F, V2 Sale Sin 15742, Sin Fo* rotq COS (E1* 7,2, —§0° ata) }, 


where we neglected couplings in triplet states, and ¥,°,7; was assumed as equal to 7°,r,(=7*.r) for simplicity. 
(Definition of &;%,7; and 7\%,77 is formally the same as in (2-18)). Therefore, no simple relations such as 
(3-9) or (3-15) exist in this case. Calculations of antinucleon-nucleon scattering without the assumption of 
nucleonic charge independence were made by Priets”) using second order perturbation of Bs-ps meson theory, 
by Koba and Takeda®) assuming a black sphere model and by Ball and Chew assuming semi-phenomenolo- 
gical antinucleon-nucleon potential and replacing its core part by a black sphere. 

*** This range seems to be unreasonably large. This can be reduced by about factor 2 by adopting a 
suitable form for the potential. Into this point, however, we shall not proceed further in this paper. 

** Analogous to (3-14), the angular distribution for the elastic (pp) scattering is given by 


dopp** (8) = (1/4) doyn (0) + (1/2) dos (0) +do%” (0). 


(continued to the next page) 
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S+p ——- +p (450MeV) 
do" (4) (reduced to 1/10 for 9 < 15°) 
dQ 
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am me te, 
3 
is} 
2 
z 


ptp ~ nin (450MeV) 


pare) (reduced to 1/10 for @>165°) 
dQ 
1. _de,,(180°—8) 


4 dQ 


(mb/sterad) 


0 30 . 60 90 120 150 180 


Fig. 1. The elastic scattering angular distribution curve has a forward peak which is essentially the 
diffraction pattern. The first minimum lies at about 25° in C.M. system. For larger angles, it is close 
to (1/4) do,,,.(0)/d2. In the backward hemisphere, two curves can be taken to be practically identical. 
The charge exchange scattering angular distribution curve has a backward peak, which comes also 
from diffraction pattern. In the forward hemisphere it is close to (1/4) do» (180° — 6) /dQ. 


In terms of scattering amplitudes, do!” (@) can be written as 


da '” (9) = (1/8) 3) Reg?” m/s ma" (OG) °9 ma’ mg (00)) da, (A-1) 


SMs Ms 


where GEER) ait S904 (@¢) and ga? vane (0g) are given by 
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(mb/sterad) 


90 120 


Fig. 2 The elastic scattering angular distribution curve has two peaks in forward and backward 
directions. Both are essentially the diffraction patterns, and small asymmetry with respect to 90° i 
due to the admixture of (fp) scattering contributions. Between the two minima (about 30° and 150°) 
the contribution from diffraction patterns is negligible, and the curve is given approximately by 


(1/2) doy, (0) /d2. 


almost vanishes at angles larger than about 60° (corresponding to 30° in the laboratory 
Therefore, it is expected that at these angles the angular distribution for the 
(pn) scattering will agree with those for (pn) and (pp) 


system ) . 
elastic part of (pp) or 


DV x (2l+1) Cra (JMs Oms) Crs (JM mug’) A—Sis7) Ym, 09), 


™ 


CoP pals Ms (0g) =3 > >») 


Z 


ap) V 2 (21+1) Crs (JM; Oms) Crs (JMs mums’) 12,7) Yim: (09) 


m 


and = 
Go mle Ms (69) => >p > ; 


= S0m., mas’ q@ (6): 


Here q‘@ (0) falls off rapidly to zero, while q‘””,, 
ne Ms (09) by qe” in! m 
dot” (0) = (1/87) Fymng' (0) dQ. 


(0¢) varies relatively slowly in this region. Therefore, 


Then, (A-1) can be written as 
(A:2) 


Fig. 1 and Fig. 2 are written in this 


/. 
s3>™M™ 


” (Og) in (A-1). 


we can approximate q\””) ,, 


In this approximation, therefore, we can avoid the explicit use of 0° 7; 
q‘®) (0) was calculated assuming imaginary potential with range 3X (?/yc) (the second column in Table 


way. 


II). do,,,,(@) was assumed to be isotropic for all angles. 
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Table IL. 
eS 
Ramee) 40 25 | 2.0 3.5 
o’s(mb) 
cmp?! 16.5 13.6 11.9 
opp?” 82 82 82 
opp?” i Ia 10.2 9.1 
OFin® 25.0 20.3 17.4 
Opn" 82 82 a2 


respectively. specially, in the case of (Pn) scattering, coexistence of partial waves with 
different parities in channnels specied by the definite value of K is forbidden by the Pauli 
principle. Thus, the angular distribution becomes symmetrical with respect to 90°*. 
These are important conclusions from the assumption of nucleonic charge independence, 


and the validity of this assumption may be checked by examining these predictions. 


§ 4. Concluding remarks 


As pointed out in § 3, we must take the range of annihilation approximately 4 2 
times larger than usually assumed if nucleonic charge independence is assumed. In this 
point, the situation becomes worse, since this range is considered to be unreasonably large 
even in the usual theory.** It gives, however, simple relations between the elastic and 
the charge exchange parts of the antinucleon-nucleon scattering and the nucleon-nucleon 
scattering, and also definite predictions for the angular distributions. Unfortunately, these 
results are based on the assumption that ¢‘,’s are negligible as compared with 0%,’s. At 
present there is no means to check the validity of this assumption, and they cannot 
avoid corresponding ambiguities.*** Nevertheless, it would be of some interest to 
investigate them further experimentally as a trial of finding a new method of analyzing 
antinucleon-nucleon interaction. 


In conclusion, the authors wish to thank Prof. Owen Chamberlain for suggesting 
this calculation. 


Strictly speaking, contributions from channels corresponding to K=0 and 1 can interfere. But this 


effect is not so appreciable since their angular distributions are very different. 

** Levy showed that, using intermediate coupling theory, the range of the imaginary part of 
the antinucleon-nucleon potential can be considerably large, of the order of (%/nc). It should also be noted!) 
that the assumption of nucleonic charge independence leads to non-local pion-nucleon interaction automatically 

** Backward peak of the charge exchange scattering in (pp) scattering and the symmetry with bidbiecs 


to 90 in (pn) scattering is comparatively free from this ambiguity. Therefore, these features will be especially 
suitable to check our assumptions. 
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Rigidity for Octupole Type Surface 
Deformation and Shell Structure 


Shota Suekane and Wataro Watari* 


Department of Physics, Osaka City University, 
Osaka 
*Research Institute for Atomic Energy, Osaka 
City University, Osaka 


May 6, 1958 


In certain regions of elements, the even- 
even nuclei have been found to possess 
rather low lying states of odd parity. These 
states have been observed in a-decay 
spectrum and the ratio of gamma-transition 
probabilities to 0‘ and 2° states of the 
heavy elements near Z~90 and N~136.” 
Low lying odd parity levels have also been 
found in the even-even nuclei in the rare 
earth region ; e. g., the lifetime of 17> state 
of ,Sm" is a factor 150 longer than a 
single proton estimate.” This region is 
related with Z~60 and N~90. The 
systematic occurrences of these low lying 
odd parity levels suggest that they may 
represent collective oscillations of octupole 
type. Some of the qualitative properties 
of octupole type oscillations of a nuclear 
surface have already been discussed by 
Copenhagen group.” Bohr and Mottelson 
have estimated the electric dipole transitions 
and attempted to obtain information on the 


nuclear octupole distortion.” Recently, Lee 
and Inglis have calculated the 
change associated with pear-shaped defor- 


energy 


mation using the nucleon wave functions 
of a spheroidal harmonic oscillator potential.” 
They found that this energy change is 
always positive but could not examine 
shell structure dependence of the octupole 
deformation because of neglect of a spin- 
orbit coupling. 

In addition to above mentioned two 
regions, the octupole type surface deforma- 
tion of a subshell of nucleon number 40 
may also be large. The E3-transitions of 
j-forbidden type occur in neutron or proton 
number 43, 45 and 47, and are transitions 
between (7/2*) state and (1/27) state. 
This may suggest that the surface rigidity 
for octupole type deformation of this sub- 
shell is very small. According to Sano’s 
calculation, main contributions to transition 
of this type come from configuration mixing 
and contributions from collective transitions 
of octupole type have only minor effects.” 
Therefore, behavior of the surface rigidity 
for octupole type of deformation in the 
subshell of nucleon number 40 is very 
interesting for us. 

We have calculated the surface rigidity 
C,; of closed shells for an octupole type 
of surface deformation using almost the 
same method as that of Marumori, Suekane 
and Yamamoto.” In our case, the 


Moskowski’s coordinates transformation can- 
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not be used, and we introduce another 
more general coordinates transformation 
which transforms a deformed surface into 
spherical one. Of course, the results of 
calculations for C, (the rigidity for 
quadrupole type of surface deformation) 
are equal in these two transformations. 
We include a spin-orbit coupling in our 
calculation and, for simplicity, neglect the 


10° 
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effects of Coulomb interactions. 
The results of our calculations are shown 
in Fig. 1. The calculated values of C, 
are very small in the neighbourhood of 
subshells of nucleon number 32, 40, 56, 


The nucleus which has 


64, 82 and 90. 
several nucleons in the next subshells of 


these subshells may be easily deformable 
for octupole type of deformation. This 
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Fig. 1. The surface rigidities of neutron cores, Gar 
Ay= (62/MV)1/?, where Ro is the nuclear radius and Vp is the depth of the potential. 
In the regions between two closed shells, the rigidity may be reduced very much 
because of the crossing of the particle levels due to the large deformation of the 


core which is caused by coupling of the core and extra-particles, 
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shell structure dependence of C, is in 
fairly good’ agreement with the trend of 
above ‘mentioned experimental evidences 
‘except for the subshell of nucleon number 
32. Especially, the smallness of the 
rigidity of the subshell of nucleon number 
40 may suggest that the above mentioned 
explanation of j-forbidden type E3-transi- 
‘tions may be correct. According to our 
calculation, the rigidity of the subshell of 
nucleon number 32 is also very small. 
Therefore, we can expect some effect of 
the octupole type of collective oscillation 
in this region, but we have no experimental 
information. Our results for nucleon 
number larger than about 100 are not so 
. reliable, because of the lack of accurate 
knowledge about the level orderings and 
their distances of shell model to above 
In this 


calculation, we used Klinkenberg’s level 


region of this nucleon number. 
scheme up to 3d;,.—level. Smaller values 
of C,; or more pronounced shell structure 
dependent behavior are expected by the 
1j (and higher odd 
parity) levels to 2y and 1: levels and this 
may correspond to the region of N~90 
and ~136. 

The full paper of this work will be 
published in near future. Preliminary report 
of this work will be read at the second 
International Conference on the Peaceful 


perturbation from 


Uses of Atomic Energy at Geneva. In 
this report, we have not been able to find 
a strong shell structure dependence of C, 
such as Fig. 1, because of unsuitable ap- 


proximation used in the calculation. 
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Stellar Syntheses of the a-Particle 
Nuclei heavier than Ne” 


Chushiro Hayashi, Minoru Nishida, 
Noboru Ohyama and Hiroshi Tsuda 


Department of Nuclear Science, Kyoto 
University, Kyoto 


May 26, 1958 


» have discussed the 


Many _authors'~ 
nuclear reactions which can go on in 
stellar interiors up to the point at which 
the helium has been exhausted by the 
3a@—>C"™ and subsequent (a, 7) reactions. 
Beyond this stage stars contract again and 
heat up until further thermonuclear reac- 
tions begin to occur inside their cores. 
To explain the cosmical abundances of a@- 
particle nuclei, Mg™, Si*, S*, A®, and 
Ca", Burbidge, Burbidge, 
Hoyle” proposed the so-called a-process. 
The first and most important reaction in 
this process is Ne*’ (7, a) O" due to its 
small negative Q-value. Once a-particles 
haye been emitted, the nuclei heavier than 


Fowler and 
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Ne” can be built by successive (a, 7 
reactions. In very hot stars consisting of 
C”, O' and Ne’, ion-ion reactions, such 
as* GC? CG? => Ne” a and C2+4+0%>5 
Mg"*-+-a@, must also be taken into account 
as a possible @-particle source. 

The purpose of this note is to investi- 
gate -the stellar conditions, that is, the 
temperature, the density and the duration 
of time which are necessary to explain the 
observed abundances of a@-particle nuclei by 
the above process. Before doing this, 
various nuclear reaction rates were calculated 
by the well-known procedures.” In calcu- 
lating the (@, 7) reaction rates for the 
nuclei heavier than Mg™, their atomic 
num ber dependence being the most important 


for our purpose, we can use the approxi- 


Ne” (Zs a) o's 


mate cross sections given by the statistical 
theory, because in such reactions excitation 
energies are so high that the resonant 
properties of individual levels are smeared 
out. Fig. 1 gives the reaction times for 
various nuclear reactions as functions of 
temperature. 

(i) Equilibrium theory 

To explain the abundances of q@-particle 
nuclei heavier than Ne” by the equilibrium 
theory, the temperature ~10'?°K and the 
matter density ~5 X 10° g/cm? are required. 
Under this condition the nuclear reaction 


times are of the order of 10~” sec and it 
seems rather unlikely that we can find 
stellar outbursts which are rapid enough 
to assure the freezing-in of this established 


equilibrium. 


2.0 Ne®+a ON +e 3,0X10°°K 


Temperature 


Fig. 1. The solid curves refer to the reaction times t (in sec), plotted against temperature T, 
while the dotted ones refer to the mean lifetimes of a-particles for (a, 7) reactions, the density 


of target nuclei being taken as 10° g/cm’, 
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(ii) Slow syntheses in stellar cores 

The fastest ion-ion reaction, C’+C*, 
competes with Ne” (7, @) OQ, at; .the 
temperature Tess 2608 K. 4 swith » the 
density e = 10° g/cm*.”” When the central 
temperature of stars consisting of C”, ye 
and Ne” with the above density rises to 
about 6~7 X 10° °K, C”+C” reaction can 
be a main energy source of these stars 
with 100~1000 times solar luminosities. 
At this temperature, however, the atomic 
number dependence of the (a@, 7) reaction 
rates are too large to reproduce the cosmical 
abundances of a@-particle nuclei heavier 
than Si’. 

Alternatively, if the stellar cores consist 
of Ne” only,” Ne” (7, @) O” is a main 
a-particle source for the formation of the 
heavier nuclei. If Ne* (7, @) O'° and sub- 
sequent (a, 7) reactions are to be the 
energy sources of such stars, the central 
temperature near 1X10°°K is required. 
This temperature is still too low to over- 
come the above difficulty. 

(iii) Rapid syntheses in the supernovae 
explosions 

At higher temperature T=2~3 X10" 
°K, the differences among the (a, 7) 
reaction rates become so small that it will 
be possible to explain the abundances con- 
cerned in terms of the a@-process. 

At p=10'g/cm* where the (a, 7) 
reaction rates are much greater than (j, 
a) ones, the emitted a-particles are 
absorbed instantaneously by se Oe oe 
Ne”, but they are hardly used for the 
formation of nuclei heavier than Si, The 
result is not different from the case (ii). 

Our calculations, however, show that the 
abundances in question can be reproduced 
by taking the density 10° gm /em', 


where the (7, @) reaction rates are com- 


parable to (a, 7) ones. At this tempera- 
ture the reaction time for Ne” (7, a) O° 
is 10-°~1 sec, while 10°~10’ sec for the 
other (7, @) reactions, and to explain the 
observed abundances it is necessary for such 
temperature to continue for a rather short 
time, 1~10° sec, with a rapid heating-up 
Such stellar 


conditions could be realized in the super- 


and a subsequent cooling. 


nova explosions, as the time scale of 


explosions is 
4) 


implosion preceding the 
estimated to be of the order of 107’ sec 
and such high temperature will continue 
for 1~10°* sec. 
The details of this note will soon appear 
in this journal. 
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Following Gell-Mann” we shall assume 
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the global symmetry of the pion-baryon 
interactions in the iso-space (all coupling 
constants in the interactions are equal). 
We shall see here what follows about the 
hyperon (Y)-nucleon (N) 


this assumption. 


forces from 
If one also neglects mass 
differences among baryons; one obtains 
Y—WN forces due to the pion-exchange 
immediately from the knowledge about 
N—WN forces.” 


the identities 


Particularly, one sees that 


40__ ©0 
Viglen Se N)=V, (2°, N) 
SS 72 


= ABN) VGN), 


Ao 0 ‘i 
a peas N)=V,(3*, N) 
#2 


\ V 


=V,,(2", N)=V,.(p, N), 


where V(X, N) means the force between 
X and N due to the pion exchange and 
n and p stand for a neutron and a proton, 
respectively (care must be taken here with 
reference to the exclusion principle). The 
remaining part 0V(Y, N) of the Y—N 
force consists of two contributions, the one 
due to mass differences JM among baryons 
and the other caused by the K-meson 
exchange.” The former would be of the 
order of JM/M as compared to V,,(Y, N), 
M being the baryon mass. The latter is 
considered relatively unimportant because 
of either or both of the following two 
reasons; (a) All K-meson baryon interac- 
tions may be ‘moderately’ strong while 
the global pion interactions are very strong. 
(b) If one assumes the existence of a hard 
core with radius~1/3 m,, (m,, pion mass) 
for all baryon pairs, major part of K- 
meson exchange forces would be masked 


Thus one may arrive 


by this hard-core. 


at an approximate means to treat the two 
baryon problem. In the Schrédinger 
equation the empirical reduced mass is 
used and the potential V(Y, N) is ap- 
proximated by V,(Y, N) (plus a hard 
core). In this simplified situation it is 
extremely simple to survey the possibility 
of bound states. It turns out that }~—n, 
=°—p and F-—n have bound states in 
the ‘S state with binding energies ~0.06, 
~0.2 and ~0.2 Mev, respectively. All 
other cases fail to form any bound states. 
Note that two neutrons cannot form any 
bound states while the above cases can. 
This is because the Y—N system has 
larger reduced mass than the N—WN system. 
In this simplified case there are no bound 
states of 2'*—p because of the Coulomb 
repulsion. However, if 6V(S*, N) gives 
rise to an attractive force which is effec- 
tively 10% of V,(S*, N),® then a 
There 
is one example reported so far which could 
be a bound 3'*—p.” If our approximation 
V(Y, N) ~V,,(Y, N) is reasonably good 
as we have suspected above, the binding 
energies of all bound Y—WN states are very 
As is well known, the smaller the 


bound +'* —p state is now possible. 


small.” 
binding energy the larger the scattering 
length. One can expect quite an appreciable 
effect of the final state interaction between 


two baryons in the processes like 
N+N>3Y+N+K, K+D>Y+N+2, 
K+DSS4N4K. 


The energy distribution of the final mesons 
is mainly determined by the scattering 
length of the accompanying baryon pairs,” 
and gives a way to test whether our ex- 
pectation about nuclear forces is really the 


case or not. 
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In phenomenological studies of nuclear 
forces the specification of the strength and 
range of the potential in terms of the 
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well-depth parameter s and the intrinsic 
range 6, as defined by Blatt and Jackson,” 
has been used frequently. Expressions for 
Gaussian, Exponential and Yukawa wells 
are given in eqs. (4-4) and (4-5) of 
BJ. The numerical constants in those 
formulae were determined by means of 
numerical integrations of the Schrodinger 
equations for zero energy. With a fast 
computor more accurate numerical constants 
for all three types of potential have now 
been computed and are reported here. 
These new constants should replace those 
given in BJ. 

In the notations of BJ we write (W (r) 
=—(M/6)V (1) : 


Gaussian well : 

W (r) =sGb~ exp{ —K(r/b)?*] 
Exponential well : 

W (r) =sGb-* exp| —K(r/6) } 
Yukawa well : 

W (r) =sGb~* exp| —K(r/6) ](r/6) 


In Table I we give the numerical values 

of constants G and K for these three types 

of potential. We have also given (in 

brackets) the values of BJ for com parison. 
If the potentials are specified as 


Gaussian well : 

V(r) = —V, exp[ — (r/#3)*] 
Exponential well : 

V(r) = —V, exp[ —2r/8] 
Yukawa well : 


V(r) = —V, exp[—1/8]/(r/8), 


one has 


s= CMV, §*/#? and b=D8 
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with M=nucleon mass and C and D 
numerical constants depending on types of 
the potential. In Table II we have given 


the values of these constants together with 


Table I 
G K 
att 5.528703 2.059871 
(5.5296) (2.0604) 
Exponential |. 18126198 3.540787 
BO (18.1301) (3.5412) 
Yuk 3.561319 2.120060 
aN (3.5605) (2.1199) 
Table II 
G D 
oo 0.3725776 1.435225 
ates ie (0.37261) (1.4354) 
= e 0.1729151 1.770393 
cnet (0.47291) (1.7706) 
Yuk 0.5953017 2.120060 
peek (0.59531) (2.1196) 


the values of Blatt and Weisskopf” in 
brackets. 

As a check on these numbers, the values 
of K and G for the exponential well have 
well-known 


been computed from the 


analytic solution.» These values agree 
with the numerically computed values to 
8 significant figures in G and 7 significant 
figures in K. 

The writer expresses his sincere thanks 
to Professor H. Messel for his hospitality 
and to Dr. J. M. Blatt for many valuable 


suggestions and continuous encouragement 


throughout the present work. 
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